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A Set of Ubiquitous Cellular Proteins 
Involved in Viral Life Cycle 

Field of the Invention 

5 This invention relates to newly identified methods for modulating viral RNA 

replication and translation of positive-strand viral RNA, particularly for the prevention 
or treatment of viral infections, especially those infections of humans. 

Background of the Invention 

10 A broad spectrum of viruses belonging to the families Picornaviridae, genera 

Enterovirus, Rhinovirus, Cardiovirus, Aphtovirus and Hepatovirus (hepatitis A virus), 
and Flaviviridae, genera Flavivirus, Pestivirus and Hepacivirus (hepatitis C virus) are 
causative agents of wide-spread human and animal diseases (reviewed in 1, 2). For 
example, pestiviruses such as bovine viral diarrhea virus (BVDV) and classical swine 

15 fever virus (CSFV) are pathogens of ruminants and pigs, which cause heavy losses in 
stock farming. Infection with human Rhinovirus (HRV) represents the main reason for 
the virus-induced common cold in man. Infection with HCV is a major cause of human 
liver disease throughout the world with seroprevalence in the general population 
ranging from 0.3 to 2.2% to as high as -10-20% in Egypt. Neither vaccination 

20 strategies nor efficient treatments could yet be developed for HRV as well as HCV 
infections. Accordingly, a major goal in current research on Picornaviridae and 
Flaviviridae concerns the definition of reasonable targets for antiviral approaches. 

The genome of Picornaviridae and Flaviviridae represents a single-stranded, . 
unsegmented RNA molecule of positive polarity. The genome organization is 

25 monocistronic, which implies that the RNA consists of a single open reading frame 
(ORF) flanked by untranslated regions (UTRs) at the 5' and 3 '-end, respectively. 
Following infection and uncoating, the viral genome operates as a messenger RNA in 
the cytoplasm of the host cell. Translation leads to the synthesis of an unstable 
polyprotein that is co- and post-translationally processed by cellular as well as viral 

30 proteases to give rise to the virus structural and non-structural proteins. The structural 
proteins constitute the virus particle: in the case of Picornaviridae, these concern 
typically four capsid proteins; in the case of Flaviviridae, the virion is composed of a 
capsid and a membrane envelope, the latter which contains two to three membrane- 
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associated viral envelope proteins. The non-structural proteins, which are 
predominantly generated by the activity of well-characterized viral proteases, are 
anticipated or have been demonstrated to act as catalytic components of the viral 
multiplication machinery. Virus-encoded enzymatic functions, beyond that of the viral 
5 proteases, which are essentially involved in the RNA replication process, include an 
RNA helicase and/or a nucleoside triphosphatase and an RNA-dependent RNA 
polymerase (RdRp) activity (Figure 1, see also references 1 and 2). 

Translation of the picornaviral as well as of the pestiviral and hepaciviral 
genomes is controlled by a unique mechanism, which significantly differs from the 

10 typel m7G cap-dependent/ribosome scanning scheme of most eukaryotic messenger 
RNAs. Extensively structured IRES elements which span a major part of the 5'UTR 
and in certain cases also the 5' -part of the ORF promote internal entry of ribosomes, i.e. 
they enable initiation of translation independently of capping and of a free 5'-end (3- 
10). This strategy allows some viruses to induce a general shut-off of the cap- 

15 depending cellular translation while maintaining protein synthesis from their own RNA 
(reviewed in 11). To support internal translation initiation, these viruses use a basic set 
of eukaryotic initiation factors but apply some modifications with respect to common 
mRNAs. Whereas picornaviral IRESes recruit nearly the same set of canonical 
translation initiation factors as capped mRNAs (12, 13), the HCV and pestivirus type 

20 IV IRES elements are capable to form the 40S eIF3 ternary pre-initiation complex 
autonomously (14). Recent data suggest that a network of interactions of tertiary 
structure motifs of the HCV core IRES with the 40S ribosomal subunit facilitates the 
association of the 43S (40S eIF3) particle with the translational start site in the absence 
of canonical translation initiation factors (15). The exact mechanism by which IRES 

25 elements mediate translation initiation remains to be determined. In addition to the 
canonical initiation factors (Ifs), the diverse IRES elements were found to bind other 
cellular proteins, which are suspected or have been shown to enhance translation 
efficiency, to confer tissue specificity or to mediate the regulation between translation 
of the infecting RNA and its replication (see below). In agreement with this concept, 

30 proteins such as La, poly C binding protein (PCBP) or hnRNP E, and poliovirus 
translation factor (PTF), polypyrimidine tract binding protein (PTB) or hnRNP I, have 
been associated with the translation of Enteroviruses; PTB and unr/unrip with HRV; 
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PTB with Aphtoviruses; liver-specific factors, GAPDH (glyceraldehyde 3-phosphate 
dehydrogenase) and PCBP with HAV; and PTB, PCBP, the ribosomal proteins S9 and 
L22, La, and hnRNP protein L with HCV (reviewed in references 16-18, for the non- 
reviewed data see 19-21). 
5 The intracellular multiplication of the viral RNA occurs as a Two-step process, 

the molecular mechanisms of which are far from being understood (see references 1, 2, 
and Figure 1). Although the priming mechanisms to initiate the synthesis of novel 
RNA molecules ought to be different in Picornaviridae and Flaviviridae some general 
homologies exist. RNA replication is known to occur exclusively in the cytoplasm of 

10 the host cell and to proceed asymmetrically along a two-step pathway. Concomitant 
with translation and proteolysis of the polyprotein, a set of non-structural viral proteins 
is presumed to associate with the termini of the genome to form membrane-associated 
replication complexes. The replication complexes initially catalyze transcription of a 
small number of complementary negative-strand RNA intermediates from which, in 

15 turn, an excess of progeny positive-strand RNA molecules are generated. Several lines 
of evidence suggest that Picornavirididae as well as Flaviviridae subvert cellular 
factors (host-factors) to participate as functional components of their replication 
complexes: to confer, for example, template RNA specificity to the RdRp (which is not 
present in vitro) or to mediate the transition between translation and RNA replication. 

20 In this context, cellular factors are discussed, which have been found to interact with 
the 3'UTR of polio virus (nucleolin, see reference 22), flaviviruses (elFla, see reference 
23), or of HCV (PTB, HuR, hnRNP C; see review 18 and references 24, 25). 

As a common feature of the life cycle of all monocistronic positive-strand RNA 
viruses, Figure 1, the viral genome has to exert two essential functions in the cytoplasm 

25 of the infected host-cell. On the one hand, the RNA is translated in 5'-3' direction, on 
the other hand, it acts as a template for the viral RdRp, which is expected to initiate the 
replication cycle at the 3'-end of the genomic RNA moving 3* to 5\ The mechanisms 
of how the RNA switches between both interdependent, although possibly competing 
processes is unknown but they are essential for the regulation of the overall virus life 

30 cycle. Data, which emerged mainly from studies with picornaviruses (reviewed in 
reference 26) suggest the following model. During the mRNA phase, translation 
prevents the initiation of the replication cycle. Then, at a certain stage, the initiation of 
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translation is blocked causing the release of ribosomes from the viral RNA. Finally, 
formation or activation of the initial replication complex <c locks" the viral RNA into a 
replication mode and promotes the synthesis of negative-strand RNA. A reasonable 
model to explain the transition from translation to RNA replication, and possibly vice 
5 versa, is a feedback communication of the UTRs of the viral genome involving the viral 
replication complex on the one hand and cellular host factors on the other hand. The 
latter proteins are expected to be associated with the translation machinery but to 
interact also with viral proteins and/or regulatory elements of the viral RNA. Such a 
model suggests a functional cross-talking between 5' and the 3'-end of the viral RNA - 
10 similarly as it has been proposed during translation regulation of capped eukaryotic 
mRNAs (27). 

Preliminary data from the poliovirus system suggest that translation takes place 
until an adequate quantity of the viral polypetide 3CD pro is accumulated (28). Aided by 
the host factor PCBP1, 302™ then interacts with a certain RNA-structure, cloverleaf, 

15 at the immediate 5'-end of the genome. This motif is essentially involved in both steps 
of the RNA replication process. Moreover, it modulates the IRES-mediated translation 
process (29). The viral/cellular ribonucleoprotein (RNP) complex is suspected to 
repress translation and to promote negative-strand RNA synthesis (28). Interestingly, 
3CDP™ was shown to associate with poly(A) + binding protein (pAblp) (30). As a 

20 possible scenario, pABlp might contact an A-rich region in the 3'UTR and thus could 
bring about a functional 5* -3' interaction of the poliovirus genome. Data obtained with 
atomic force microscopy indicate indeed a closed loop conformation of the poliovirus 
genome (31). Indications for a 5'-3' communication of the viral genome exist also for 
the flavivirus Kunjin and hepatitis C virus (32, 33). 
• 25 The identification of cellular factors or vRbps, which are critical for the 

intracellular multiplication process of RNA viruses, and the characterization of the 
functional interplay between these factors with viral proteins and genomic elements of 
the viral RNA are key to understanding replication of these viruses. Inhibiting the 
biological activity of such factors may potentially benefit cells by controlling, reducing 

30 and alleviating diseases caused by infection with these viruses. 

Many viruses encode protein factors to circumvent the antiviral response of the 
cellular host to an infection. Along this line, certain viral proteins such as the vaccinia 
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E3L, the influenza virus protein NS1 and the rotavirus NSP3 associate with double- 
stranded (ds) RN A, and bind to dsRNA-dependent protein kinase (PKR) in order to inhibit 
its antiviral activity (reviewed in reference 34). PKR, the expression of which is 
induced by dsRNA and/or the activity of interferons, e.g., as a result of a viral infection, 
5 is a serine/threonine kinase with multiple functions in control of transcription and 
translation (reviewed in reference 35). The enzyme, which is activated through its 
binding to dsRNA, plays a role in mediating apoptosis as well as signal transduction 
events that are involved in the interferon response of the cell to accelerate virus 
clearance. Moreover, the activated PKR phosphorylates the a subunit of the eukaryotic 
10 translation initiation factor eIF2. Phosphorylation of eIF2a inhibits the recycling of 
eIF2 and consequently blocks the cellular translation machinery in response to viral 
infection. Accordingly, proteins, which mimic the PKR-eIF2a interaction domain, 
were found to inhibit the activity of PKR (34). 



5 
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Summary of the Invention 

The invention relates to a set of cellular polypeptides, their production and uses, 
as well as variants, agonists and antagonists and their uses. In particular, in these and 
5 in other regards, the invention relates to a set of cellular polypeptides, hereinafter 
referred to as viral RNA binding proteins (vRbp). The set of cellular polypeptides 
preferably associate with the untranslated regions of the genomes of different 
representatives of virus families, preferably, the Picornaviridae and Flaviviridae 
families. The experimental data obtained with the Flaviviridae members BVDV and 

10 HCV implicate these proteins are involved in the regulation of the translation and 
replication process of the viral RNA. Preferably, they may be crucially involved in the 
regulation of the translation and replication process of the viral RNA. Remarkably, the 
majority of these cellular polypeptides represent dsRNA binding proteins, which may 
associate with PKR and thus inhibit its activity. Therefore, the recruitment of these 

15 factors by the diverse viral RNAs may serve a second purpose, Le., to block the 
antiviral activity of PKR in the host cell. The newly identified viral/cellular 
ribonucleoprotein (RNP) complex is accordingly expected to represent a meaningful 
target for antiviral substances that are either capable to interfere directly with the viral 
multiplication process or to increase the efficiency of the endogenous antiviral 

20 response. 

One aspect of the invention is a method for modulating viral RNA replication and 
translation, in a eukaryotic cell, of positive-strand viral RNA, comprising the step of 
contacting a viral RNA-binding protein (vRbp) with a compound that modulates an 
activity of said vRbp. Preferred aspects of this method include vRbps selected from the 

25 group consisting of: vRbpl30, vRbpl20, vRbpllO, vRbp84, vRbp64, and vRbp45. In 
other alternative methods, the activity of the vRbp is selected from the group consisting 
of a response to viral RNA, interferon induction, double-stranded RNA-dependent 
protein kinase (PKR), and to another vRbp. Furthermore, other embodiments of the 
claimed invention include a response to the formation of a viral:ceUular 

30 ribonucleoprotein (RNP) complex. Alternative RNP complexes include a viral 
RNA:vRbp interaction, binding of a vRbp to a viral RNA 3 9 untranslated region (3TJTR) 
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or binding of a vRbp to a viral RNA 5* untranslated region (5TJTR). Another 
embodiment of the invention is wherein the 3TJTR is a UGA box consensus sequence. 

In still another aspect of the invention, methods for modulating viral RNA replication 
and translation include modulating the activity of a vRbp wherein the activity is a 
5 response to viral RNA circularization. In one aspect of the invention includes modulating 
the binding of vRbp td the viral S'XJTR and 3'UTR, which creates a physical and 
functional link between both ends of the RNA. A preferred embodiment of the 
invention provides for a method, of modulation an interaction between viral 5T7TR, 
3TJTR RNA, vRbp, and cellular proteins involved in the interferon antiviral response. 

10 In yet another aspect of the invention, methods for modulating viral RNA replication 
and translation include modulating the activity of a vRbp wherein the activity is a 
response to an increase in translational frameshifting that result in decreased viral 
replication, or formation of a vRbp:PKR interaction. 

Other embodiments of the invention include methods for modulating viral RNA 

15 replication and translation wherein viral replication and translation comprises 
coordinated regulation of replication and translation of viral RNA. 

Alternative embodiments include methods for modulating viral RNA replication and 
translation wherein the eukaryotic cell is, but not limited to, a mammalian cell, a 
human cell, or a liver cell. 

20 Alternative embodiments include methods for modulating viral RNA replication and 
translation wherein viral RNA is positive strand viral RNA from viral families including 
Flaviviridae and Picornaviridae. 

Other aspects of the present invention include compounds for modulating viral RNA 
replication and translation. Alternative embodiments include therapeutically effective 

25 amounts of viral 3'UTR, fragments thereof, or pharmaceutical^ acceptable derivatives 
thereof for modulating viral RNA replication and translation. Further embodiments of 
the invention include methods for reducing vRbp activity by interfering with the 
interaction between vRbp and vRbp recognition sites on viral RNA. One embodiment 
that reduces vRbp activity is by modification of a viral 3TJTR, which modification 

30 otherwise reduces vRbp binding to vRbp recognition sites on viral RNA. Another 
embodiment that reduces vRbp activity is by inhibiting dissociation of viral RNA:vRbp 
complexes. 
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In another aspects of the invention, method for reducing the effects of viral 
infection on eukaryotic cells, comprising inhibiting vRbp activity in the cell such that 
viral replication and translation of viral RNA is regulated by interactions between vRbp 
and said viral RNA, comprising introducing a nucleic acid decoy molecule into the cell 
5 in an amount sufficient to inhibit viral RNArvRbp interactions, which decoy includes a 
vRbp recognition site that binds to vRbp. Alternative methods for reducing the effects 
of viral infection on eukaryotic cells, include inhibiting vRbp activity in the cell such 
that viral replication and translation of viral RNA is regulated by interactions between 
vRbp and PKR, comprising introducing a nucleic acid decoy molecule into the cell in 

10 an amount sufficient to inhibit vRbp:PKR interactions, which decoy includes a vRbp 
recognition site that binds to vRbp. 

Additional aspects of the invention include methods for reducing the effects of viral 
infection on eukaryotic cells, comprising the step of reducing vRbp activity in the cell 
such that viral replication and translation is reduced. Prefered embodiments include 

15 methods for reducing the effects of viral infection on eukaryotic cells, the method 
comprising the step of reducing vRbp activity in the cell such that production of novel 
infectious virus particles is reduced, steps of reducing vRbp activity in the cell to 
inhibit the spread of virus in infected individuals and animals, steps of reducing vRbp 
activity in the cell to prevent the spread of virus between different individuals and 

20 animals, or steps of reducing vRbp activity in the cell to treat syndromes caused by co- 
infection of different viruses, such as, HCV and HBV or HCV and HIV. Othere 
alternatives to methods reducing the effects of viral infection on eukaryotic cells, 
include steps of reducing vRbp activity in the cell to treat before, during, and after a 
transplantation, steps of modulating vRbp activity in the cell to treat immunosuppressed 

25 patients to prevent virus infections. 

Another aspect of the invention includes a method for reducing the effects of viral 
infection, in a eukaryotic cell, by modulating vRbp activity in the cell, the method 
comprising the step of interfering with viral translation termination as a mechanism to 
disrupt viral replication. Furthermore, an alternative method of the invention for 

30 reducing the effects of viral infection, in a eukaryotic cell, is to modulate viral RNA- 
binding protein (vRbp) activity in the cell, the method comprising the step of 
interfering with interactions between viral 3TJTR and 5UTR, or interactions between 
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structural elements within the 3TJTR and NS5B stop codon as a mechanism to regulate 
translation termination, translational frameshifting, and the coordinated balance of 
replication and translation on positive strand RNA, such as RNA from a member of the 
family Flaviviridae, or Picornayiridae. 
5 Other embodiments of the invention include a method of treating or preventing a 
viral infection by a virus comprising the step of administering a therapeutically effective 
amount of a compound to an individual suspected of having or being at risk of having 
an infection with a virus, such as, hepatitis A virus (HAV), hepatitis C virus (HCV), 
human Rhinovirus (HRV), bovine viral diarrhea virus (BVDV), and classical swine fever 

10 virus (CSFV). An embodiment of the claimed compound may compound interact with 
viral genomic 3TJTR or 5UTR RNA. Alternative aspects of the invention include 
methods for modulating the function of a viral 3UTR comprising the step of contacting 
a 3TJTR with a compound that modulates the structure of the 3TJTR as to inhibit the 
interaction between 3TJTR and vRbp. 

15 Another aspect of the invention is a method for screening to identify compounds that 
activate or that inhibit the function of vRbp which comprises a method selected from the 
group consisting of: 

(a) mixing a candidate compound with a solution containing a vRbp, to form a 
mixture, measuring activity of the vRbp in the mixture, and comparing the 

20 activity of the mixture to a standard; 

(b) detecting the effect of a candidate compound on the production of viral 
RNA in a eukaryotic cell, using for instance, an ELISA assay, reticulocyte 
lysate translation assay (luciferase RNA); and 

(c) (1) contacting a composition comprising the vRbp with the compound to 
25 be screened under conditions to permit interaction between the compound and the 

vRbp to assess the interaction of a compound, such interaction being associated 
with a second component capable of providing a detectable signal in response to 
the interaction of the vRbp with the compound; and 

(2) determining whether the compound interacts with and activates or 
30 inhibits an activity of the vRbp by detecting the presence or absence of a signal 

generated from the interaction of the compound with the vRbp. 
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An alternative embodimejit of the invention is a method for screening to identify 
compounds that increase translational frameshifting resulting in decreased replication 
of viral RNA comprising a method selected from the group consisting of: 

(a) mixing a candidate compound with a solution containing a vRbp, to form a 
5 mixture, measuring activity of the vRbp in the mixture, and comparing the 

activity of the mixture to a standard; and 

(b) detecting the effect of a candidate compound on the production of viral 
RNA in a eukaryotic cell, using for instance, an ELISA assay, reticulocyte 
lysate translation assay (luciferase RNA). 

10 Other aspects and advantages of the present invention are described further in the 
following detailed description of the preferred embodiments thereof. ' 

Brief Description of the Figures 

Figure 1 graphically illustrates the genome organization and replication cycle of 

15 Picornaviridae, Pestiviruses and Hepaciviruses. (A) Schematic representation of the 
organization of Picornaviridae, Pestiviruses and Hepatitis C virus genomes. The 5' 
and 3'untranslated regions (UTRs) are indicated as black lines, the protein-coding 
region (ORF) as a box. The proteolytic cleavage products of the ORF-encoded 
polyprotein are shown as differently shaded regions. The dot at the 5 '-end of the 

20 Picornaviridae genome indicates the VPg protein (or 3B protein), which is associated 
to the 5'-end of all Picornaviridae RNAs. L specifies a leader protein found in 
cardioviruses, Theiler viruses and aphtoviruses; it is not present in enteroviruses, 
human rhinovirus, or human hepatitis A virus. 1A-1D represent the Picornaviridae 
capsid proteins. C, E™ 3 , El and E2 are the structural components of the Pestivirus 

25 virion. C, El and E2 are the structural components of the Hepaciviruses. Note that 
Picornaviruses have different internal ribosomal entry sites (types I-III). The IRES of 
Pestiviruses and Hepaciviruses was termed as type IV. (B) Schematic representation of 
the replication pathway of monocistronic .RNA viruses. Upper level: general 
organization of the genome of monocistronic positive-strand RNA viruses (see A). The 

30 5'-end may be either capped (as with Havi viruses) or it may contain an IRES region, 
the 3'UTR may be polyadenylated or not. For a detailed description of the replication 
scheme see text or references 1 and 2. 

10 
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Figure 2 graphically illustrates organization of monocistronic and bicistronic BVDV 
and HCV RNA replicons. Top: organization of subgenomic BVDV replicon RNAs in 
comparison with the full-length viral genome. In the case of the monocistronic BVDV 
5 replicon "DI9c," the coding region of the pestiviral protein N 1 * 0 is directly fused to the 
NS3 coding region. N 1 " 0 is, an autoprotease and enables the generation of the NS3 
protein with its authentic N-terminus. DI9c or functional parts of it have been used in 
most experiments, which .were aimed at characterizing the different functional 
determinants of the translation and replication process of the BVDV RNA (see text). 

10 'Bicistronic replicons" contain an additional, heterologous ORF. The additional gene 
may encode a resistance-marker (Hyg=hygromycine B phosphotransferase; 
Neo=neomycin phosphotransferase) or other enzymes (e.g. GUS=6-glucoronidase). 
The additional ORF was cloned upstream of an encephalomyocarditis (EMCV) IRES- 
element, the latter which maintains expression of the viral non-structural proteins. 

15 Generation of the authentic N-terminus of the heterologous gene product was enabled 
by fusing a portion of the N 1 * 0 gene, this is necessary to ascertain efficient IRES 
function, and an ubiquitine gene to the S'-tenninus of the additional ORF. Generation 
of the authentic N-terminus of the heterologous protein is thus enabled by the activity 
of cellular ubiquitin C-terminal hydrolases. Two types of BVDV replicons were 

20 employed in our assay-systems, namely ncp and cp types. Ncp implies that these 
RNase are non-cytopathic and hence persist in the transfected host-cell. These RNAs 
express predominantly the full-length NS2-3 protein. Generation of the authentic N- 
tenninus of NS2-3 is enabled by cellular peptidases, which cleave at the C-terminus of 
the peptide p7, which is also encoded by the ORF. Cp indicates cytopathogenicity, Le., 

25 lysis of the host-cell at a certain time post transfection. A cytopathogenic phenotype 
correlates with the predominant expression of NS3 (2). Accordingly, DI9c represents a 
cp replicon RNA. Bottom: organization of mono and bicistronic HCV replicons. The 
organization basically resembles to that of the BVDV replicons described above. AC 
indicates a short region of the Core protein-coding region, which was shown to be 

30 important for efficient translation initiation. In certain cases, a ubiquitine gene was 
inserted. 

11 
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. Abbreviations: mono-monocistronic, bi-bicistronic, cp-cytopathic, ncp-non cytopathic, 
A-indicates an incomplete genetic unit, ubi-indicates the ubiquitine gene to mediate 
proteolytic cleavage by ubiquitine carboxy-terminal hydrolases at this position), het. 
gene-indicates a gene encoding a heterologous protein. The proteolytic cleavage sites 
5 are indicated as follows: arrow-cleavage by NS3/NS4A, circle-cleavage by cellular 
signalases, A-autoproteolytic activity, ?-uncertain. 

Figure 3 graphically illustrates RNA secondary structure of the 3'UTRs of a BVDV 
(strain CP7/CP9; see 36 and references herein) and of an HCV isolate (strain IB; 38). 

10 The depicted sequence initiates with the translational UGA stop-codon (indicated by 
italics). The structure of the BVDV 3'UTR was determined by experimental means 
(43): nucleotide residues that were found to be exposed to RNases or chemical 
modification are indicated in dark grey (highly exposed) or light grey (less exposed). 
The UGA box elements and pseudo-stops are boxed. The arrow marks the border 

15 between the 3'V and 3'C regions as proposed by Deng and Brock (47). The RNA 
secondary structure of the HCV 3'UTR was calculated with the mfold 3.1 computer 
program. 

Figure 4 graphically illustrates (A) Secondary structure of the 5*UTRs of BVDV and 
20 HCV (reviewed in reference 16). The diverse RNA domains and the AUG translational 
start-codon are indicated. The minimal IRES elements are boxed, the so called "core- 
domains" are marked by dashed circles. HCV 5'UTR: the arrows indicate regions, 
which were found to harbour important replication signals (52, 44). (B) Structure and 
functions of the BVDV hairpin la and "hairpin lb" motifs. The structures of la and lb 
25 were determined by Yu et al. (43, 45): residues that were found to be exposed to 
RNases or chemical modification are indicated as in Fig. 3. Hairpin lb is written in 
quotation marks, because the experimental data contradict the formation of a hairpin 
structure. Nucleotides that are essential for replication are boxed; elements that 
enhance the replication efficiency are indicated by dashed boxes. Elements that 
30 enhance the translation efficiency are indicated by a dashed circle (43, 45). 
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Figure 5 graphically illustrates (A) A set of cellular proteins binds to the 3'UTR of the 
BVDV DI9c replicon RNA. UV cross-linking/label transfer experiments were 
performed with viral and non-viral RNA probes and cytoplasmic extracts of BHK-21 
cells. The composition of the utilized RNA probes is schematized in the lower part of 
5 the figure. Indicated are the restriction sites that were used to generate the respective 
templates for run-off transcription, and, in the case of the viral RNAs (3'BVDV and 
3'HCV), the translational stop-codon. A grey box depicts, the non-related BKS RNA; 
open boxes correspond to the untranslated regions of the viral RNAs, black boxes stand 
for residual parts of the viral ORF. Cytoplasmic extracts (total amount of protein: ca. 

10 20 fig/assay volume) of mock-transfected, lanes 1, 3, 5 and 7, or BVDV DI9c 
transfected BHK-21 cells, lanes 2, 4, 6 and 8, were utilized for cross-linking with the 
different [ 32 P] UTP-labeled RNA transcripts. Protein labeling was analyzed by 10% 
SDS-PAGE. In the control-reactions shown in lane 7 and 8, RNA-protein complexes 
formed on radio-labeled 3'BVDV RNA were digested with proteinase K prior to 

15 exposure to UV-light. Marker proteins are indicated on the left; the most significantly 
RNA-charged proteins, marked by arrows, were denoted according to their suggested 
molecular weights, namely pl30, pl20, pi 10, p84, p67 p64, and p45 (termed as 
"vRbps" in the text). (B) The same set of RNA-binding proteins is present in various 
cell types and interacts with the 3'UTR of different pestiviruses. Top: cross-linking 

20 study with labeled BVDV DI9c 3'UTR and cytoplasmic extracts of BHK-21, MDBK 
and HeLa S3 cells. UV cross-linking/label transfer was performed with BKS RNA, 
lanes 1 to 3, or 3'BVDV RNA, lanes 4 to 6), respectively. Positions of the most 
strikingly labeled proteins are indicated by arrows, see Fig. 5A. By competition 
experiments, see Fig. 5C, the protein marked with an asterisk, lane 6, was demonstrated 

25 to bind non-specifically to the viral RNA (data not shown). Bottom: cross-linking 
study with labeled BVDV or CSFV 3'UTR RNA using cytoplasmic extract of BHK-21 
cells (similar results were obtained with extracts of other cell types, data not shown). 
The composition of the different RNA probes is schematized in the lower part of the 
figure. Lane 1, control assay with BKS RNA; lane 2, cross-link assay with 3'BVDV 

30 RNA; lane 3, ~ with 3'CSFV RNA; lane 4, control reaction with 3'BVDV RNA, 
performed as described in Fig. 5A. Molecular weights are indicated on the left. 
Arrows point at the major RNA-protein complexes. (C) Cellular proteins pl30, pl20, 



WO 2004/029199 



PCT/US2003/028654 



pi 10, p84, p64, and p45 bind in a specific manner to the pestiviral 3'UTR. Aliquots of 
cytoplasmic extract of BHK-21 cells, approximately 10 \xg of total protein/assay, were 
incubated with either [ 32 P]-labeled BKS RNA probe, lanes 1 to 3, or 3'BVDV RNA, 
lanes 4 to 8, in the absence or presence of the below-indicated amounts of unlabeled 
5 competitor RNA, respectively. After treatment with UV-light, the proteins were 
analyzed on SDS-PAGE. Lane 1 and 4, assay without competitor; lane 2 and 5, 
identical experiment performed as in lane 1 but in the presence of a 200 fold molar 
excess of non-specific BKS competitor RNA; lane 3 and 6, - in the presence of a 200 
fold molar excess of specific 3'BVDV competitor RNA; lane 7, ~ in the presence of a 

10 200 fold molar excess of 3'CSFV RNA; lane 8, ~ in the presence of a 200 fold molar 
excess of 3'HCV RNA. As in the previous figures, the molecular masses of the 
radiolabeled ribonucleoprotein complexes are indicated by arrows. (D) Exploring the 
BVDV DI9c 3'UTR for the host factor binding site(s). Comparison of 3'V regions of 
different pestivirus genotypes, sequences obtained from Genbank databases, revealed 

15 the conservation of stretches of 12 nucleotides. The nucleotide sequences of 
representatives of the different pestivirus genotypes (BVDV-1, BVDV-2, CSFV and 
BDV) were extracted from the GenBank/EMBL database and computer-aligned. An 
A/U-rich sequence element, which is present in all different viral genomes was found to 
be located at either position 43 or 46 of the respective 3'UTR; it was termed 

20 UGAposxons. box (nomenclature as in Fig. 3). Interestingly, most of these motifs (only 
exception BVDV NADL) are positioned "in frame" with the viral ORF. Hence, the 
distance of the UGAposxras. box with regard to the translational stop codon corresponds 
to 14 or 15 triplet-units, "pseudo codons," respectively. As indicated by the consensus 
sequence shown in the lower part of the figure, the UGA pos .cons. boxes contain 4 

25 nucleotides that are 100% conserved, (bold typed and underlined) among all different 
viral genomes. These nucleotides are also conserved in other, "additional" UGA boxes 
such as those of BVDV Osloss, BVDV CP7 (BVDV DI9c) and BVDV Singer at 
position 16 or 19 of the respective 3'UTR. Note that most UGA boxes contain "pseudo 
stop-codons" such as UAA at their 3'-end. (E) pl30, pl20, pi 10, p84, p64, and p45 

30 bind specifically to a single UGA box sequence motif. Left: UV-induced label-transfer 
experiments with cytoplasmic extracts of BHK-21 cells and RNA probes containing 
defined parts of the 3'V region of BVDV DI9c RNA. The composition of the applied 
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RNA transcripts is schematically drawn in the lower part of the figure: Bml/m2 RNA 
covers the S'-terminal part of the BVDV DI9c 3'V region (residues 10-63 in the 
numbering scheme of Fig. 3) and thus includes the 5'UGA box and the UGApo S .cons. 
box, depicted as grey boxes. Bm2 RNA consists mainly of the UGApos.cons. box 
5 sequence, grey box, of the BVDV DI9c RNA (residues 40-62 in the nomenclature of 
Fig. 3). To allow an estimation of the binding capacity of the different RNAs identical 
molar amounts of Bml/m2 RNA and Bm2 RNA were employed in the UV cross- 
linking assay. Lane 1, negative control assay with non-related BKS RNA; lane 2, 
positive control assay with 3'BVDV RNA; lane 3, assay with Bral/m2 RNA; lane 4, 

10 assay with Bm2 RNA. Molecular weights and positions of the RNA-charged proteins 
are indicated as in all previous figures. Proteins* which were found to bind non- 
specifically to the RNA transcripts, data not shown, are marked with asterisks. Right: 
competition experiments with Bml/m2 RNA and Bm2 RNA. Competition of the 
binding of the cellular proteins to 3'BVDV RNA, lanes 1-5, or Bml/m2 RNA, lanes 6- 

15 10, was investigated by using BKS RNA as a non-specific competitor, lanes 2 and 7, 
and 3'BVDV RNA, lanes 3 and 8, Bml/m2 RNA, lanes 4 and 9, and Bm2 RNA, lanes 
5 and 10, as specific competitors, respectively. To allow an estimation of the 
competition efficiency of each of the different RNAs, identical molar amounts were 
included into the respective experiments. Molecular weight markers and positions of 

20 RNA-protein complexes are indicated as in the previous figures. 

Figure 6 graphically illustrates Binding of the vRbps to the BVDV 3'V region 
correlates with the efficiency of translation initation, translation termination, and 
replication of the viral RNA. (A) RNA secondary structure of the wt BVDV 3*UTR 

25 and of two 3'V mutants. The RNA structure was determined by experimental probing 
(see Fig. 3). Mutant 1 comprised a, deletion of 57 residues, le. t of both 5'-terminal 
UGA boxes, and a double point-mutation affecting the 3'UGA-like box and the folding 
of SLII, respectively. Mutant 2 comprised nine point mutations that modified the 
consensus of all three UGA-boxes, the pseudo-stops and the folding of SLstop and SIM, 

30 respectively. (B) Effect of mutagenesis on the association of the viral RNA binding 
proteins to the BVDV S'UTR. Wt and mutant 3'UTRs were tested by UV 
crosslinking/label transfer for the association of host-factors pl30, pl20, pi 10, p84, 

15 
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, p67, and p64, respectively. As shown, both mutant RNAs associate the cellular 
proteins to a significantly lower degree with respect to the wild-type RNA, for further 
details, see Fig. 5 and text. (C) Effect of mutagenesis on the rate of replication and 
translation of the viral RNA. Replication was determined with monocistronic BVDV 
5 constructs (Fig. 2) via quantitative RNase protection of progeny positive-strand RNA. 
Translation was quantitated in vitro by the expression of the N pro protein essentially as 
described by Yu et al. (43). (D) Translational read-through assay. In vitro translation 
was performed in the presence of [ 35 S] methionine with a minigenomic RNA encoding 
the UTRs and a shortened ORF (53). With the wt, only the ORF-encoded proteins, NS 
10 fus and N pro , the latter which is autoproteolytically released, are expressed. An 
additional product corresponding in size exactly to the NS fus + translated 3'UTR is 
detectable with both 3'V mutants (53). 

Figure 7 graphically illustrates data that support the idea of a protein-mediated 
15 interaction of the termini of the BVDV RNA. (A) UV crosslinking/label transfer 
experiments with transcripts of the BVDV 5'UTR, HCV 5'UTR and BVDV 3'UTR. 
The proteins, which were confirmed to associate specifically to the viral RNAs, see Fig. 
5, are indicated as in the previous figures. Asterisks mark proteins found to bind non- 
specifically. Polypyrimidine-tract binding protein (PTB) is indicated, which was 
20 previously shown by the same assay to bind to the HCV 5'UTR. (B) 5'-3' co- 
precipitation assay. The experiment was generally performed with a biotinylated 
5'UTR transcript and a [ 32 P]-labelled 3'UTR transcript. Precipitation was performed 
with streptavidine-beads. As a control, the precipitation was carried out in the absence 
of protein or in the presence of bovine serum albumine. Note that the data indicate a 
25 slight interaction of both termini also in the absence of the cellular proteins. (C) Model 
of a protein-mediated cross talk of the 5' and the 3' end of the viral RNA. 5'-3' 
interaction might be a way to coordinate the translation (5'-3') and the replication (3'- 
5') cycle. 

30 Figure 8 graphically illustrates that different viral IRES elements recruit the same set 
of cellular proteins. (A) UV crosslinking/label transfer experiments with transcripts 
comprising the BVDV 5'UTR, BVDV 3'UTR, HAV 5'UTR, EMCV 5'UTR and 
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Rhinovirus 5'UTR. BKS RNA was used as a control. Proteins, which were confirmed 
to associate specifically to the viral RNAs (see Fig. 5), are indicated as in the previous 
figures. Asterisks mark proteins that bind non-specifically. Polypyrimidine-tract 
binding protein (PTB) and unr are indicated, unr was not confirmed. PTB was 
5 previously shown by the same assay to bind to the HAV, EMCV and rhinovirus 
5'UTR; unr was previously shown to bind to the rhinovirus 5'UTR (1). (B) UV 
crosslinking/label transfer experiments with transcripts comprising the BVDV 5'UTR, 
BVDV 3'UTR, HCV 5'UTR and the HCV 5'UTR+GUAU. The experiment was 
performed as described in the previous figures and in the text. Additional details 
10 concerning HCV 5'UTR +GUAU see Fig. 12. 

Figure 9 graphically illustrates data indicating an association of the cellular proteins 
with the HAV core-IRES domain. The different graphs show the RNA secondary 
structure of the different core-IRES domains of typel, type II, type HI and type IV 

15 IRESes as proposed by Le et al. (59). The translational start-codon as well as 
nucleotides that are 100% conserved between all different viruses are indicated; N 
stands for a variant number of nucleotides. In the case of the HCV IRES, the core- 
IRES model exhibits striking similarities with the RNA structure determined by RNase 
digestion and chemical modification procedures (see 15 and 16 and references herein; 

20 and Fig. 4). In comparison with the BVDV and HCV IRES, the HAV IRES element is 
bigger in size (ca. 350 nt versus 723 nt), and it has a less compact shape (see reference 
16). Thus, as a reasonable approach to generate RNA transcripts encompassing the 
correctly folded core-IRES domain, RNA transcripts corresponding to the HAV 5'UTR 
were digested with RNaseH in the presence of a suitable oligonucleotide, the site where 

25 RNaseH cuts is indicated in the figure. The resulting core-IRES RNA was purified and 
subjected to a UV -crosslinking/label transfer approach. The pattern of labelled proteins 
was compared side-by-side with that obtained with full-length HAV 5'UTR and BVDV 
3'UTR, respectively. As shown on the right portion of the figure, the pattern of 
labelled proteins turned out to be nearly identical in all three experiments. 

30 

Figure 10 graphically illustrates purification and identification of the viral RNA 
binding cellular factors. (A) Purification. Top: scheme summarizing the different 
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fractionation steps, starting material S10 extracts of Hela cells. Bottom: fractions of 
. proteins eluted by a salt gradient from the MonoQ sepharose column were tested via 
UV crosslinking/label transfer assay with 3'BVDV RNA to monitor the elution of the 
different vRbps. The SDS PAGE shows analysed fractions eluted between 300 and 450 
5 mM KC1, indications as in the previous figures. Lane 1 - pattern of labelled proteins 
obtained by UV crosslinking of total cytoplasmic extract of Hela cells; lane 2 - pattern 
of labelled proteins obtained by UV crosslinking of the heparine flow-thru fraction. 
Due to the fact that the entire set of proteins elutes in the fractions analysed on lanes 8- 

10, these fractions were pooled and the proteins separated on a preparative SDS PAGE. 
10 Coomassie-stained protein bands migrating at 84 kDa were cut out, digested with 

trypsine and the peptides extracted from the gel. MALDI-TOF analysis and 
microsequencing was performed on different tryptic peptides. (B) Identification of the 
viral RNA binding proteins part I. Top: side-by-side comparison of the pattern of 
vRbps labelled by UV cross-link/label transfer with radioactive 3'BVDV RNA and the 

15 pattern of proteins stained by a mixture of aNF90 and aNF45 antibodies (61) on 
western blots of total cytoplasmic extracts of Hela cells. The proteins, which are stained 
by the individual aNF90 and aNF45 antisera are indicated (data of separate blots not 
shown). The identity of the different proteins was concluded by the results obtained 
during microsequencing and data that were published on NFAT/NF90/NFAR- 1 , 

20 NFAR2 and NF45 by other laboratories (see text). Bottom: schematic representation of 
the structure and the motifs harboured by the aminoterminal 686 AA of all members of 
the NFAT/NF90/NFAR family. Abbreviations: NLS=nuclear localization sequence, 
dsRBM=double-strand RNA binding motif, RG=arginine glycine-rich RNA binding 
domain. 

25 

Figure 11 graphically illustrates identification of the viral RNA binding proteins part 

11. (A) "Supershift" of RNA-protein complexes by aNF90 and aNF45 antibodies. 
RNA mobility shift assay (RMSA) with [ 32 P] labelled RNA transcripts. Different 
amounts of cytoplasmic extracts, increasing amounts from right to left, were incubated 

30 with a specific [ 32 P] labelled RNA probe, e.g., HCV 5'UTR, BVDV 3'UTR, and 
comparable amounts of a non-specific antiserum and of aNF90 and aNF45 antisera, 
respectively. The RNP and RNP/antibody complexes (indicated on the right) were 
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separated on a 5% acrylamide/Tris borate gel. (B) RNA-protein coprecipitation (pull- 
down) assay with in vitro translated NF90 protein. In vitro translated [ 35 S] labelled 
NF90 protein was incubated with a specific, e.g., HCV 5'UTR, and a non-specific, e.g., 
BKS RNA transcript, respectively. The unlabelled RNA transcripts contained a poly-A 
5 tail and were subsequently precipitated by oligo dT sepharose. In vitro translated [ 35 S] 
labelled luciferase protein was used as a control. 

Figure 12 graphically illustrates implications for HCV. (A) Schematic representation 
of functional HCV/B VDV and BVDV/HCV chimeric RNAs. Top/left: RNA secondary 

10 structure of hairpin la of the BVDV 5'UTR (see also Fig. 4); the four GUAU 
nucleotides, which were found to be essential for BVDV RNA replication are depicted 
in red. Top/middle: RNA secondary structure of the HCV hairpin la +5'GUAU. 
BVDV RNA, where the BVDV 5'UTR was substituted by the HCV 5'UTR + GUAU 
was found to be replication competent (without GUAU, the BVDV RNA was 

15 replication deficient, 51). Top/right: UV crosslinking/label transfer analysis of RNA 
transcripts encompassing the HCVS'UTR, HCV5'UTR4GUAU and the BVDV 
5'UTR. The viral RNA binding proteins are indicated as in the previous figures. 
Bottom/left: schematic drawing of the organization of the hybrid HCV 3'V region 
containing the BVDV UGA box elements instead of SLstop, for additional details, see 

20 Fig. 12B. Bottom/right: UV crosslinking/label transfer analysis of RNA transcripts 
comprising the HCV 3'UTR and the HCV 3'UTRASLstop+BVDV 5'UGA boxes, 
respectively. (B) Structure and organization of HCV 3'V mutant RNAs. Top: RNA 
secondary structure of a BVDV and HCV 3'UTR (see also Fig. 3). Bottom: RNA 
secondary structure, calculated by mfold 3. 1 , of the HCV 3'V ASL st0 p mutant and of the 

25 HCV/BVDV 3'V chimera (see Fig. 12A). In Fig. 12B, the BVDV-derived sequence is 
depicted in light gray (HCV/BVDV chimera 5' loop); six additional nucleotides 
corresponding to an Afll restriction site in the original cDN A construct are depicted as 
CUUAAG in the HCV/BVDV chimera Fig. 12B. 

Figure 13 graphically illustrates an RNAi approach with aRHA oligonucleotides 
30 inhibits HCV replication. 

Description of the Invention 
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. The invention relates to a set of polypeptides, their production and uses, 
as well as variants, agonists and antagonists and their uses. In particular, in these and 
in other regards, the invention relates to a set of cellular polypeptides, hereinafter 
referred to as viral RNA binding proteins (vRbp). Preferably, vRbps include, but are 
5 not limited to vRbpl30, vRbpl20, vRbpllO, vRbp84, vRbp67, vRbp64, and vRbp45. 
Evidence is presented implicating a critical involvement of these proteins in the life 
cycle of positive-strand RNA viruses containing type I, type n, type in and type IV 
IRES (internal ribosomal entry site) elements: i.e., Enterovirus, Rhinovirus, 
Cardiovirus, Aphtovirus, hepatitis A virus, hepatitis C virus and pestivirus. 

10 Accordingly, [vRbpl30, vRbpl20, vRbpllO, vRbp84, vRbp67, vRbp64 and vRbp45] 
their potential protein interaction partners as well as their interaction-site(s) on the 
respective viral RNAs should be considered as targets for treatment of disease 
syndromes associated with infections of any of these viruses. The present invention 
relates to or unquestionably demonstrates that different members of the 

15 NFAT/NFAR/NF90 polypeptide family represent vRbpllO, vRbp84, and vRbp64, 
respectively, and that the NF90 associated polypeptide NF45 represents vRbp45. 
vRbpl20 is indicated to represent RNA helicase A (RHA). Other data implicate the 
proteins to regulate the coordination of translation and replication of the diverse viral 
genomes. Because all NFAT/NFAR/NF90 variants as well as RHA interact and/or are 

20 substrates of the dsRNA-activated protein kinase PKR, [vRbpl30, vRbpl20, vRbpllO, 
vRbp84, vRbp67, vRbp64 and vRbp45] are suggested to antagonize the cellular 
defence mechanisms against viral infections. 

The starting point of this invention was the discovery that subgenomic B VDV RNAs 
that lack the coding regions of the virus structural proteins are replication competent in 

25 transfected host-cells (36). BVDV "replicon RNA" can be generated by in vitro 
transcription from cloned cDNA constructs; it replicates in a wide range of different 
host-cells (e.g. MDBK, BHK-21, human hepatocytes or HeLa cells). In the meantime, a 
broad spectrum of monocistronic as well as bicistronic BVDV replicons has been 
composed (36, 37; see also Fig, 2). Essentially, they harbor the 5'UTR and the 3'UTR 

30 of the viral genome as well as a truncated part of the viral ORF, which comprises 
mainly five genetic units: i.e., NS3, NS4A, NS4B, NS5A and NS5B. The N-terminus of 
NS3 contains a serine protease domain, which, together with the NS4A cofactor, 
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catalyses the proteolytic cleavages of the non-structural NS3-NS5B polyprotein. The C- 
terminus of NS3 associates an ATPase and RNA helicase activity. NS5B represents the 
viral RdRp. The ftmction(s) of NS4B and NS5A are not known (reviewed in 2). The 
genomic organization of the region encoding NS3 to NS5B is virtually colinear in 
5 pestiviruses and hepaciviruses. Accordingly, the finding that subgenomic RNAs 
encompassing the UTRs and the NS3 to NS5B coding region encode all factors and 
elements, which, on the part of the virus, suffice for genome amplification has recently 
been extended to hepatitis C virus (38; Fig. 2). In comparison with BVDV, HCV RNA 
replicates less efficient (ca. 10.000 versus 1000 copies of viral RNA per cell), and its 

10 replication is restricted to only one host cell-type (i.e., Huh-7 cells). 

Due a number of experimental advantages with respect to full-length viral RNA, 
the most important of which concerns the possibility to examine RNA replication 
independently of events linked to RNA packaging and/or virion assembly, BVDV and 
HCV replicons are currently utilized to define individual components of the replication 

15 complex and to characterize their mode of activity. As a general experimental scheme, 
the viral RNA is mutagenized via the cDNA construct (a procedure termed as "reverse 
genetics") and the effects of mutagenesis on replication are monitored using 
appropriate assay systems such as RNase protection or RT-PCR. Reverse genetics 
studies are completed by biochemical experiments such as RNA structure probing, UV- 

20 crosslinking/label transfer experiments (see below) and specific assay systems to 
measure the enzymatic activities of the NS3 protease, the NS3 ATPase/RNA helicase 
and the NS5B RdRp, respectively (39-42). 

Besides serving as an experimental system to identify and to characterize the 
molecular determinants that control the viral RNA replication pathway, BVDV and 

25 HCV replicons proved to be useful tools to study the IRES-mediated translation 
process. For this purpose, in vivo translation assays were established, the most 
meaningful of which apply bicistronic constructs encoding a heterologous enzymatic 
activity such as O-glucoronidase (37; see also Fig. 2). In addition, the Applicants 
developed an in vitro translation assay based on cytoplasmic initiation factor fractions 

30 of authentic host cells (BHK-21 cells for BVDV; Huh-7 cells for HCV). Programmed 
with genuine viral RNA, the in vitro system was shown to appropriately mimic the in 
vivo situation of viral polyprotein synthesis and processing (39, 41, 43-45). Studies of 
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other laboratories and Applicants of the present invention revealed the following 
findings related to BVDV and HCV replicon systems, which are relevant for this 
invention: 

■ Taking a genetic approach to the BVDV replicon, the Applicants could show that 
5 all the mature replicon-encoded non-structural proteins NS3 to NS5B and all known 

virus-encoded enzymatic activities (protease, helicase, polymerase) are essentially 
involved in an early stage of the replication cycle. The majority of the non- 
structural proteins, namely NS3, NS4A, NS4B and NS5B were indicated to act in 
cis (e.g. in statu nascendi) during assembly of the replication complex; only one 
10 protein, NS5A, was suggested to operate in trans. In summary, these data 
demonstrate a close functional linkage of translation and processing of the viral 
proteins and their activity during replication. NS5A appears to play a particular role 
during viral RNA replication (39, 41). 

■ The S'-terminal portion of the viral ORF, which encodes the N-terminus of the 
15 autoprotease N pro (pestiviruses) or the N-tenninus of the capsid protein C (HCV), 

respectively, represents a functional entity of the IRES: i.e. this region is important 
for efficient translation, while it is only slightly involved in RNA replication. 
However, expression of an intact N 1 ™ or C protein is not essential for RNA 
replication (36, 37, 38, 45, 46). In conclusion, on the part of the virus, only the 
20 proteins that derive from the NS3 to NS5B coding region (i.e. the fully processed 
NS3 to NS5B proteins and hypothetical cleavage intermediates of the NS3-NS5B 
polyprotein) are involved in the assembly of the pestiviral and HCV replication 
complex. 

■ Structure probing and genetic approaches revealed that the highly conserved 3'- 
25 terminal portions of the BVDV and HCV 3'UTRs (termed as 3'C regions) form 

extensive stem-loop (SL) structures, which are essential for viral replication (42, 
47-50; see Fig. 3). With BVDV, structure as well as sequence motifs of the 3'C 
region were shown to be part of the "negative-strand promoter" of the initial 
replication complex; i.e., mutations, which modified these motifs were found to 
30 block the first step of the replication cycle (42). Importantly, the BVDV 3'C region 
was determined to be not essential for IRES-mediated translation initiation (45). 
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■ Similar strategies identified replication signals in the 5'UTR of the BVDV and the 
HCV genome, respectively. With BVDV, these motifs could be exactly defined; 
they concern sequence elements, which are exclusively located at or near the 
immediate S'-terminus of the viral RNA (43, 45, 51; see Fig. 4). With HCV, yet 

5 undefined replication signals are harbored by the S'-terminus of the viral genome; 
other elements appear to be localized in the IRES domain (52 and our data 44), As a 
common concept, the BVDV and HCV 5'UTRs contain "bi-functional" RNA 
elements, which modulate the translation as well as the replication process. Along 
this line, the overall integrity of the BVDV hairpin la structure and of the HCV 

10 domain III were found to be important for efficient translation initiation. In 
addition, these motifs contain sequence elements that are essential for the 
replication cycle. Reminiscent of the situation with the ORF or the 3'UTR (see 
above), mutations, which affected the replication signals in the BVDV la structure 
were observed to inhibit already the first replication step (43, 44). This important 

15 finding suggests that not only the 3'-end but also the 5'-end of the viral genome is 
involved in an early step of the replication pathway (see below). 

■ In contrast with the 3'C region of the BVDV and HCV 3'UTR, the region 
immediately downstream of the ORF exhibits a remarkable heterogeneity in terms 
of size and sequence composition. Accordingly, it was designated as the variable 

20 3'V region of the 3'UTR (47). Sequence alignments, computer modeling of the 
RNA secondary structure and experimental structure probing revealed that the 
pestiviral as well as the HCV 3'V region harbor several conspicuous RNA features. 
On the one hand, these concern so-called pseudo-stop elements, i.e. stop-codon like 
nucleotide triplets that are organized "in frame" with the ORF (Fig. 3). On the other 

25 hand, the portion of the 3'UTR immediately downstream of the translational stop 
forms an extensive stem-loop structure (termed as SLstop)- However, the pestivirus 
SUtqp structures exhibit generally a low stability, while the analogous structure of 
the HCV genome appears to be rather stable (53; see Fig. 3). Another difference 
concerns moderately conserved A/U-rich sequence elements (termed as "UGA- 

30 boxes"; consensus sequence: S'A/U-A-U/G/A-U-G/A-U-G/A-U/GA-AAJ-G/U-A- 
U/G/A3'; bold-typed residues are 100% conserved among all pestivirus species), 
which are located in single or multiple copies downstream of the translational stop- 
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codon of the pestivirus ORF. These elements are not present in the HCV 3'UTR. 
Instead, the HCV 3'V region contains a long polyU stretch and a polypyrimidine- 
rich region (Fig. 3). Interestingly, deletion of SL^op and/or the mutagenesis of 
conserved nucleotides within the BVDV UGA boxes caused a lower rate of 
5 translation initiation and inhibited the replication of the altered viral RNA (53; see 
also below). Similarly, deletion of the HCV SLstop structure was found to inhibit 
RNA replication. Most interestingly is the finding that despite of the 
aforementioned differences, the BVDV and HCV SL^op structures were shown to 
be functionally interchangeable (see below). 

10 

Taken together, three types of functional elements of the viral RNAs can be 
discriminated, (i) RNA structure motifs and sequence elements at the immediate 3' -end 
of the viral genome, which operate exclusively as replication signals, (ii) The IRES 
domain, which spans a major portion of the 5'UTR as well as the 5' -terminus of the 

15 protein-coding region. Although cap-independent entry of ribosomes is generally 
enabled by this region (55, 56), other parts of the RNA molecule such as the BVDV la 
domain and the 3' V region of the 3'UTR of the BVDV and HCV genome (54 and our 
data 53) were shown to have a considerable impact on the efficiency of translation 
initiation, (iii) Motifs at each end of the RNA molecule (with BVDV: haiipin la at the 

20 5'-end and the UGA box motifs in the 3' V region), which modulate translation as well 
as. replication of the viral RNA. The bi-functional character of these elements suggests 
that they play a key role during regulation of translation and RNA replication. 

Glossary 

25 The following definitions are provided to facilitate understanding of certain 

terms used frequently hereinbefore. 

The following definitions are provided to facilitate understanding of certain terms used 
frequently hereinbefore. 

"Isolated" means altered "by the hand of man" from its natural state, i.e., if it 
30 occurs in nature, it has been changed or removed from its original environment, or 
both. For example, a polynucleotide or a polypeptide naturally present in a living 
organism is not "isolated," but the same polynucleotide or polypeptide separated from 
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the coexisting materials of its natural state is "isolated", as the term is employed herein. 
Moreover, a polynucleotide or polypeptide that is introduced into an organism by 
transformation, genetic manipulation or by any other recombinant method is "isolated" 
even if it is still present in said organism, which organism may be living or non-living. 
5 "Antibodies" as used herein includes polyclonal and monoclonal antibodies, 

chimeric, single chain, and humanized antibodies, as well as vRbp fragments. 

"Polynucleotide" generally refers to any polyribonucleotide (RNA) or 
polydeoxribonucleotide (DNA), which may be unmodified or modified RNA or DNA. 
"Polynucleotides" include, without limitation, single- and double-stranded DNA, DNA 

10 that is a mixture of single- and double-stranded regions, single- and double-stranded 
RNA, and RNA that is mixture of single- and double-stranded regions, hybrid 
molecules comprising DNA and RNA that may be single-stranded or, more typically, 
double-stranded or a mixture of single- and double-stranded regions. In addition, 
"polynucleotide" refers to triple-stranded regions comprising RNA or DNA or both 

15 RNA and DNA. The term "polynucleotide" also includes DNAs or RNAs containing 
one or more modified bases and DNAs or RNAs with backbones modified for stability 
or for other reasons. "Modified" bases include, for example, tritylated bases and 
unusual bases such as inosine. A variety of modifications may be made to DNA and 
RNA; thus, "polynucleotide" embraces chemically, enzymatically or metabolically 

20 modified forms of polynucleotides as typically found in nature, as well as the chemical 
forms of DNA and RNA characteristic of viruses and cells. "Polynucleotide" also 
embraces relatively short polynucleotides, often referred to as oligonucleotides. 

"Polypeptide" refers to any polypeptide comprising two or more amino acids 
joined to each other by peptide bonds or modified peptide bonds, i.e., peptide isosteres. 

25 "Polypeptide" refers to both short chains, commonly referred to as peptides, 
oligopeptides or oligomers, and to longer chains, generally referred to as proteins. 
Polypeptides may contain amino acids other than the 20 gene-encoded amino acids. 
"Polypeptides" include amino acid sequences modified either by natural processes, 
such as post-translational processing, or by chemical modification techniques that are 

30 well known in the art. Such modifications are well described in basic texts and in more 
detailed monographs, as well as in a voluminous research literature. Modifications may 
occur anywhere in a polypeptide, including the peptide backbone, the amino acid side- 

25 



WO 2004/029199 



PCT/US2003/028654 



chains and the amino or carboxyl tennini. It will be appreciated that the same type of 
modification may be present to the same or varying degrees at several sites in a given 
polypeptide. Also, a given polypeptide may contain many types of modifications. 
Polypeptides may be branched as a result of ubiquitination, and they may be cyclic, 
5 with or without branching. Cyclic, branched and branched cyclic polypeptides may 
result from post-translation natural processes or may be made by synthetic methods. 
Modifications include acetylation, acylation, ADP-ribosylation, amidation, 
biotinylation, covalent attachment of flavin, covalent attachment of a heme moiety, 
covalent attachment of a nucleotide or nucleotide derivative, covalent attachment of a 

10 lipid or lipid derivative, covalent attachment of phosphotidylinositol, cross-linking, 
cyclization, disulfide bond formation, demethylation, formation of covalent cross-links, 
formation of cystine, formation of pyroglutamate, formylation, gamma-carboxylation, 
glycosylation, GPI anchor formation, hydroxylation, iodination, methylation, 
myristoylation, oxidation, proteolytic processing, phosphorylation, prenylation, 

15 racemization, selenoylation, sulfation, transfer-RNA mediated addition of amino acids 
to proteins such as arginylation, and ubiquitination (see, for instance, Proteins - 
Structure and Molecular Properties, 2nd Ed., T. E. Creighton, W. H. Freeman and 
Company, New York, 1993; Wold, F., Post-translational Protein Modifications: 
Perspectives and Prospects, 1-12, in Post-translational Covalent Modification of 

20 Proteins* B. C. Johnson, Ed., Academic Press, New York, 1983; Seifter et al, 
"Analysis for protein modifications and nonprotein cofactors", Meth Enzymol, 182, 
626-646, 1990, and Rattan et al., "Protein Synthesis: Post-translational Modifications 
and Aging", Ann NY Acad Sci, 663, 48-62, 1992). 

"Fragment" of a polypeptide sequence refers to a polypeptide sequence that is 

25 shorter than the reference sequence but that retains essentially the same biological 
function or activity as the reference polypeptide. "Fragment" of a polynucleotide 
sequence refers to a polynucleotide sequence that is shorter than the reference sequence 
of a vRbp. 

"Variant" refers to a polynucleotide or polypeptide that differs from a reference 
30 polynucleotide or polypeptide, but retains the essential properties thereof. A typical 
variant of a polynucleotide differs in nucleotide sequence from the reference 
polynucleotide. Changes in the nucleotide sequence of the variant may or may not alter 
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the amino acid sequence of a polypeptide encoded by the reference polynucleotide. 
Nucleotide changes may result in amino acid substitutions, additions, deletions, fusions 
and truncations in the polypeptide encoded by the reference sequence, as discussed 
below. A typical variant of a polypeptide differs in amino acid sequence from the 
5 reference polypeptide. Generally, alterations are limited so that the sequences of the 
reference polypeptide and the variant are closely similar overall and, in many regions, 
identical. A variant and reference polypeptide may differ in amino acid sequence by 
one or more substitutions, insertions, deletions in any combination. A substituted or 
inserted amino acid residue may or may not be one encoded by the genetic code. 

10 Typical conservative substitutions include Gly, Ala; Val, lie, Leu; Asp, Glu; Asn, Gin; 
Ser, Thr; Lys, Arg; and Phe and Tyr. A variant of a polynucleotide or polypeptide may 
be naturally occurring such as an allele, or it may be a variant that is not known to 
occur naturally. Non-naturally occurring variants of polynucleotides and polypeptides 
may be made by mutagenesis techniques or by direct synthesis. Also included as 

15 variants are polypeptides having one or more post-translational modifications, for 
instance glycosylation, phosphorylation, methylation, ADP ribosylation and the like. 
Embodiments include methylation of the N-terminal amino acid, phosphorylations of 
serines and threonines and modification of C-terminal glycines. 

"Allele" refers to one of two or more alternative forms of a gene occurring at a 

20 given locus in the genome. 

"Polymorphism" refers to a variation in nucleotide sequence (and encoded 
polypeptide sequence, if relevant) at a given position, in the genome within a 
population. 

"Single Nucleotide Polymorphism" (SNP) refers to the occurrence of nucleotide 
25 variability at a single nucleotide position in the genome, within a population. An SNP 
may occur within a gene or within intergenic regions of the genome. SNPs can be 
assayed using Allele Specific Amplification (ASA). For the process at least 3 primers 
are required. A common primer is used in reverse complement to the polymorphism 
being assayed. This common primer can be between 50 and 1500 bps from the 
30 polymorphic base. The other two (or more) primers are identical to each other except 
that the final 3 'base wobbles to match one of the two (or more) alleles that make up the 
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polymoiphism. Two (or more) PCR reactions are then conducted on sample DNA, 
each using the common primer and one of the Allele Specific Primers. 

"Splice Variant" as used herein refers to cDNA molecules produced from RNA 
molecules initially transcribed from the same genomic DNA sequence but which have 
5 undergone alternative RNA splicing. Alternative RNA splicing occurs when a primary 
RNA transcript undergoes splicing, generally for the removal of introns, which results 
in the production of more than one mRNA molecule each of that may encode different 
amino acid sequences. The term splice variant also refers to the proteins encoded by 
the above cDNA molecules. 

10 "Identity" reflects a relationship between two or more polypeptide sequences or 

two or more polynucleotide sequences, determined by comparing the sequences. In 
general, identity refers to an exact nucleotide to nucleotide or amino acid to amino acid 
correspondence of the two polynucleotide or two polypeptide sequences, respectively, 
over the length of the sequences being compared. 

15 "% Identity" - For sequences where there is not an exact correspondence, a "% 

identity" may be determined. In general, the two sequences to be compared are aligned 
to give a maximum correlation between the sequences. This may include inserting 
"gaps" in either one or both sequences, to enhance the degree of alignment. A % 
identity may be determined over the whole length of each of the sequences being 

20 compared (so-called global alignment), that is particularly suitable for sequences of the 
same or very similar length, or over shorter, defined lengths (so-called local alignment), 
that is more suitable for sequences of unequal length. 

"Similarity" is a further, more sophisticated measure of the relationship between 
two polypeptide sequences. In general, "similarity" means a comparison between the 

25 amino acids of two polypeptide chains, on a residue by residue basis, taking into 
account not only exact correspondences between a between pairs of residues, one from 
each of the sequences being compared (as for identity) but also, where there is not an 
exact correspondence, whether, on an evolutionary basis, one residue is a likely 
substitute for the other. This likelihood has an associated "score" from which the "% 

30 similarity" of the two sequences can then be determined. 

Methods for comparing the identity and similarity of two or more sequences are 
well known in the art. Thus for instance, programs available in the Wisconsin 
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Sequence Analysis Package, version 9.1 (Devereux J et al, Nucleic Acids Res, 12, 387- 
395, 1984, available from Genetics Computer Group, Madison, Wisconsin, USA), for 
example the programs BESTFTT and GAP, may be used to determine the % identity 
between two polynucleotides and the % identity and the % similarity between two 
5 polypeptide sequences. BESTFTT uses the "local homology" algorithm of Smith and 
Waterman (J Mol Biol, 147,195-197, 1981, Advances in Applied Mathematics, 2, 482- 
489, 1981) and finds the best single region of similarity between two sequences. 
BESTFTT is more suited to comparing two polynucleotide or two polypeptide 
sequences that are dissimilar in length, the program assuming that the shorter sequence 

10 represents a portion of the longer. In comparison, GAP aligns two sequences, finding a 
"maximum similarity", according to the algorithm of Needleman and Wunsch (J Mol 
Biol, 48, 443-453, 1970). GAP is more suited to comparing sequences that are 
approximately the same length and an alignment is expected over the entire length. 
Preferably, the parameters "Gap Weight" and "Length Weight" used in each program 

15 are 50 and 3, for polynucleotide sequences and 12 and 4 for polypeptide sequences, 
respectively. Preferably, % identities and similarities are determined when the two 
sequences being compared are optimally aligned. 

Other programs for determining identity and/or similarity between sequences 
are also known in the art, for instance the BLAST family of programs (Altschul S F et 

20 al, J Mol Biol, 215, 403-410, 1990, Altschul S F et al, Nucleic Acids Res., 25:389- 
3402, 1997, available from the National Center for Biotechnology Information (NCBI), 
Bethesda, Maryland, USA and accessible through the home page of the NCBI at 
www.ncbi.nlm.nih.gov) and FASTA (Pearson W R, Methods in Enzymology, 183, 63- 
99, 1990; Pearson W R and Lipman D J, Proc Nat Acad Sci USA, 85, 2444-2448,1988, 

25 available as part of the Wisconsin Sequence Analysis Package). 

Preferably, the BLOSUM62 amino acid substitution matrix (Henikoff S and 
Henikoff J G, Proc. Nat. Acad Sci. USA, 89, 10915-10919, 1992) is used in 
polypeptide sequence comparisons including where nucleotide sequences are first 
translated into amino acid sequences before comparison. 

30 Preferably, the program BESTFTT is used to determine the % identity of a query 

polynucleotide or a polypeptide sequence with respect to a reference polynucleotide or 
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a polypeptide sequence, the query and the reference sequence being optimally aligned 
and the parameters of the program set at the default value, as hereinbefore described. 

"Identity Index" is a measure of sequence relatedness which may be used to 
compare a candidate sequence (polynucleotide or polypeptide) and a reference 
5 sequence. Thus, for instance, a candidate polynucleotide sequence having, for 
example, an Identity Index of 0.95 compared to a reference polynucleotide sequence is 
identical to the reference sequence except that the candidate polynucleotide sequence 
may include on average up to five differences per each 100 nucleotides of the reference 
sequence. Such differences are selected from the group consisting of at least one 

10 nucleotide deletion, substitution, including transition and trans version, or insertion. 
These differences may occur at the 5' or 3' terminal positions of the reference 
polynucleotide sequence or anywhere between these terminal positions, interspersed 
either individually among the nucleotides in the reference sequence or in one or more 
contiguous groups within the reference sequence. In other words, to obtain a 

15 polynucleotide sequence having an Identity Index of 0.95 compared to a reference 
polynucleotide sequence, an average of up to 5 in every 100 of the nucleotides of the in 
the reference sequence may be deleted, substituted or inserted, or any combination 
thereof, as hereinbefore described. The same applies mutatis mutandis for other values 
of the Identity Index, for instance 0.96, 0.97, 0.98 and 0.99. 

20 Similarly, for a polypeptide, a candidate polypeptide sequence having, for 

example, an Identity Index of 0.95 compared to a reference polypeptide sequence is 
identical to the reference sequence except that the polypeptide sequence may include an 
average of up to five differences per each 100 amino acids of the reference sequence. 
Such differences are selected from the group consisting of at least one amino acid 

25 deletion, substitution, including conservative and non-conservative substitution, or 
insertion. These differences may occur at the amino- or carboxy-terminal positions of 
the reference polypeptide sequence or anywhere between these terminal positions, 
interspersed either individually among the amino acids in the reference sequence or in 
one or more contiguous groups within the reference sequence. In other words, to obtain 

30 a polypeptide sequence having an Identity Index of 0.95 compared to a reference 
polypeptide sequence, an average of up to 5 in every 100 of the amino acids in the 
reference sequence may be deleted, substituted or inserted, or any combination thereof, 
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as hereinbefore described. The same applies mutatis mutandis for other values of the 
Identity Index, for instance 0,96, 0.97, 0.98 and 0.99. 

The relationship between the number of nucleotide or amino acid differences 
and the Identity Index may be expressed in the following equation: 
5 n a £x a -(x a »I), 

in which: 

n a is the number of nucleotide or amino acid differences, 

x a is the total number of nucleotides, or amino acids in ROCK or ROCK, respectively, 
I is the Identity Index, 
10 • is the symbol for the multiplication operator, and 

in which any non-integer product of x a and I is rounded down to the nearest integer 
prior to subtracting it from x a . 

"Homolog" is a generic term used in the art to indicate a polynucleotide or polypeptide 
sequence possessing a high degree of sequence relatedness to a reference sequence. 
15 Such relatedness may be quantified by determining the degree of identity and/or 
similarity between the two sequences as hereinbefore defined. Falling within this 
generic tenn are the terms "ortholog", and "paralog". "Ortholog" refers to a 
polynucleotide or polypeptide that is the functional equivalent of the polynucleotide or 
polypeptide in another species. "Paralog" refers to a polynucleotideor polypeptide that 
20 within the same species which is functionally similar. 

"Modulates" means in reference to an activity herein, resulting in a change in an 
amount, and/or quality, and/or effect of a particular response and/or activity. Both 
increases and/or decreases in a response and/or activity are included. 

"Picornaviridae" as used herein refers to a family of single-stranded RNA- 
25 containing viruses that cause hepatitis in humans. 

"Enterovirus" as used herein refers to a genus of Picornaviridae that 
preferentially replicate in the mammalian intestinal tract. It includes the polioviruses 
and Coxsackie viruses. 

"Rhinovirus" as used herein refers to a genus of Picornaviridae that largely 
30 infect the upper respiratory tract. Include the common cold virus and foot and mouth 
disease virus. 
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"Cardiovirus" as used herein refers to a genus of viruses belonging to the 
Family Picornaviridae, isolated mostly from rodents, cause encephalitis and 
myocarditis. 

"Hepatovirus" as used herein refers to a genus of Picornaviridae causing 
5 infectious hepatitis naturally in humans and experimentally in other primates. It is 
transmitted through faecal contamination of food or water. 

"Aphtovirus" as used herein refers to a genus of the family picornaviridae 
causing foot-and-mouth disease in cloven-hoofed animals. 

"Flaviviridae" as used herein refers to a family of single-stranded RNA- 
10 containing viruses that cause haemorrhagic fever in a wide range of mammals and are 
transmitted by mosquitos, such as West Nile Virus, and ticks. 

"Ravi virus" as used herein refers to a genusof Flaviviridae, also known as 
group b arbovirus, containing several subgroups and species. Most are arboviruses 
transmitted by mosquitoes or ticks. The type species is yellow fever virus. 
15 "Pestivirus" as used herein refers to a genus of Flaviviridae, also known as 

mucosal disease virus group, which is not arthropod-borne. Transmission is by direct 
and indirect contact, and by transplacental and congenital transmission. Species 
include border disease virus, bovine viral diarrhea virus (diarrhea virus, bovine viral), 
and hog cholera virus. 

20 "Hepacivirus" as used herein refers to a non-A, non-B RNA virus causing post- 

transfusion hepatitis; it appears to be a member of the family Flaviviridae. 

"Antagonist" as used herein refers to a substance that tends to nullify the action 
of another, as a drug that binds to a cell receptor without eliciting a biological response. 
"Agonist" as used herein refers to a substance that has affinity for and 
25 stimulates physiologic activity at cell receptors normally stimulated by naturally 
occurring substances, thus triggering a biochemical response. 

"Fusion protein" refers to a protein encoded by two, often unrelated, fused 
genes or fragments thereof. In one example, employing a fusion protein is 
advantageous for use in therapy and diagnosis resulting in, for exaimple, improved 
30 pharmacokinetic properties. On the other hand, for some uses it would be desirable to 
be able to delete part of a protein. 
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"vRbpl30" as used herein refers to a post-translational modification of RNA 
helicase A. 

"vRbpl20 M as used herein refers to a complex with NF90/NFAR1 and NF45 
(RNA helicase A or RHA). 
5 "vRbpllO" as used herein refers to an alternatively spliced form of NFARI 

(NFARIQ. 

"vRbp84" as used herein refers to a C-terminally modified NF90 (NFARI). 

"vRbp67" as used herein refers to a 64 kDa subunit of cleavage stimulatory 
factor (CSTF) involved in polyadenylation of mRNAs, which however, does not bind 
10 specifically to viral RNAs. 

"vRbp64" as used herein refers to an alternatively spliced foxm of NFARI and 
NFARH 

"vRbp45" as used herein refers to a complex with NF90/NFAR1 and RNA 
helicase A (NF45). 

15 "Cross-Talk" as used herein refers to extensive interactions between the viral 

termini (3* and 5' UTR), or interactions between the structural elements within the 3htr 
and the stop codon in NS5B are likely to be critical in regulating translation 
termination, translational frameshifting and the coordinated balance of replication and 
translation on the positive strand RNA. As HCV is an RNA virus, the viral RNA forms 

20 highly ordered secondary, and tertiary conformations. Many of these conformations 
have been determined by biophysical probing, such as that for the 5fctr. It is equally 
likely, that the ordered stem-loop structures of the RNA are critical to control 
translation and replication. Circularization of the viral genome may occur directly via 
the UTRs or facilitated by the UTR along with said cellular proteins bound to the UTR. 

25 Additionally, multiple contacts of the UTR RNA, or UTR RNA with said cellular 
proteins bound, may interact with other regions of the viral genome. 

In addition, the present invention relates to methods of interfering with the 
translational regulation and replication of HCV RNA could occur by providing excess 
amounts of 3 UTR RNA, or 3 UTR RNA elements which are required for interacting 

30 with the said cellular proteins. In effect, providing an exogenous source of viral RNA 
capable of binding the said cellular proteins should effectively serve as a sink, to titrate 
out the 'activity' of these cellular proteins. If they were sufficiently removed from the 
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test system, viral replication should be substantially reduced. Since these proteins may 
be directly required for viral replication, and their availability to interact with the 
authentic viral genome becomes limited upon effective binding to the RNA decoy sink, 
viral replication should be decreased. Additionally, removal of these cellular proteins 
5 from binding the authentic viral genome, may result in the loss of coordinated 
regulation between translation and replication. An decrease in accurate termination of 
translation would be expected to be a direct outcome of the loss of these cellular 
proteins binding to the authentic UTR of the viral genomes. Upon increased translation 
beyond the authentic stop codon in NS5B, steric hindrance or competition between the 

10 ribosomes (for translation) and the initiation of viral RNA synthesis by the replicase 
complex binding to the 3TJTR, should result in a direct decrease in viral replication. 
Also, systems utilizing peptide-nucleic acid conjugates may represent a more attractive 
approach to creating a functional sink with nucleic acids, and with such sinks as having 
improved DMPK properties over conventional nucleic acids. 

15 "Reticulocyte lysate translation assay" as used herein refers to methods for 

modulating a fraction of said cellular proteins within translation extract (luciferase 
RNA), should result in modulation of luciferase activity and therefore translation. In 
addition, the assay can also (i) measure impact on PKR, (ii) look at UTRs or mutant 
UTRs (containing mutations within binding sites for said cellular proteins) to modulate 

20 translation, and (iii) monitor compound interference. 

"Cell-based translation frameshift assay" as used herein refers to methods for 
assays that identify compounds that would be predicted to enhance translational 
frameshifting, and/or decrease translation termination at authentic stop codon. 
Compounds capable of doing this would be expected to result in ribosomes moving 3' 

25 from the stop codon, and represent a steric hindrance for replicase protein binding. The 
assay can (i) monitor by ELISA for small peptide generated by this frameshift, (ii) 
could use a BRET assay to monitor the interaction of said cellular proteins from 5 UTR 
with said cellular proteins binding 3 UTR, and (iii) can be a measure of genome 
circularization. 

30 All publications and references, including but not limited to patents and patent 

applications, cited in this specification are herein incorporated by reference in their 
entirety as if each individual publication or reference were specifically and individually 
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indicated to be incorporated by reference herein as being fully set forth. Any patent 
application to which this application claims priority is also incorporated by reference 
herein in its entirety in the manner described above for publications and references. 

5 Examples 

The invention is further illustrated by way of the following examples which are 
intended to elucidate the invention. These examples are not intended, nor are they to be 
construed, as limiting the scope of the invention. It will be clear that the invention may 
be practiced otherwise that as particularly described herein. Numerous modifications 
10 and variations of the present invention are possible in view of the teachings herein and, 
therefore, are within the scope of the invention. The examples below are carried out 
using standard techniques, which are well known and routine to those of skill in the art, 
except where otherwise described in detail. 

15 Example 1: A set of ubiquitous cellular proteins binds to the 5' and 3'UTR of 
pestiviral RNA and is critically involved in translation and RNA replication. 

The new invention concerns a set of RNA-binding proteins (termed as vRbpl30, 
vRbpl20, vRbpllO, vRbp84, vRbp67, vRbp64 and vRbp45), which were originally 
identified by UV crosslinking/label transfer approaches to bind to the UGA-box 

20 elements of the BVDV 3'V region (53). Competition experiments demonstrated that 
binding of vRbpl30, vRbpl20, vRbpllO, vRbp84, vRbp64 and vRbp45 to the viral 
RNA is highly specific. vRbp67 was determined to bind in a non-specific manner (53; 
see Fig. 5). The vRbp "host-factors" are ubiquitous in all cell-types that support BVDV 
replication (Fig. 5), and they can be fractionated from a ribosomal salt wash (53). The 

25 latter result suggested that several of these proteins represent non-canonical 
components of the cellular translation apparatus (see below). While the specific binding 
factors vRbpl30, vRbpl20, vRbpllO, vRbp84, vRbp64 and vRbp45 were suggested or 
evidently shown to represent different members of dsRNA binding proteins (for details, 
see below), the non-specific RNA-binding protein vRbp67 was demonstrated to 

30 correspond to the 64 kDa subunit of cleavage stimulatory factor (CSTF) (reviewed in 
reference 57). 
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Importantly, binding of the vRbps to the UGA elements correlated strictly with 
the ability of BVDV replicon RNA to amplify within the host-cell. Moreover, the 
interaction of these cellular factors with the 3' V region was indicated to be essential for 
the clearance of translating ribosomes from the viral RNA. Thus, mutant BVDV RNAs 
5 containing deletion and/or point mutations, which changed the sequence of the UGA 
box and pseudo-stop elements (the latter, which are mostly part of the UGA box 
consensus sequence) and which modified the folding of SLs to p and SLII of the 3'V 
region, were found to associate the vRbps to a significantly lesser extent (Fig. 6). As 
mentioned above, the efficiency of translation initiation of these mutant RNAs was 

10 found to be reduced, and, most strikingly, proper termination of translation was 
observed to be impaired, i.e. a significant read-through of the translation^ stop-codon 
of the ORF by ribosomes could be detected (Fig. 6). Consistent with the idea that 
translation and replication are mutually exclusive events (see above), and that 
incomplete translation termination should interfere with the assembly of the functional 

15 replication complex, viral RNA derivatives encoding thus modified 3' V regions turned 
out to be replication deficient (Fig. 6). As explained further below, analogous results 
were obtained with HCV RNA (44, 53). 

A further series of crosslinking and competition experiments demonstrated that 
the identical range of factors (vRbp 130, vRbpl20, vRbpllO, vRbp84, vRbp64 and 

20 vRbp45) bind also specifically to the BVDV 5'UTR (Fig. 7). Importantly, binding of 
the vRbp proteins to the 3'V region could be competed with transcripts consisting of 
the 5'UTR and vice versa (53). The RNA-protein interaction site(s) within the BVDV 
5'UTR (note that the 5'UTR does not contain UGA box like sequence elements) has 
not yet been defined. However, initial indications came from experiments showing that 

25 hairpin la mutations, which inhibited translation and/or RNA replication, respectively 
0, reduced the capability of the BVDV 5'UTR to associate the vRbps (53). Since 
hairpin la per se does not bind the proteins (53), the RNA-protein interaction domain is 
suggested to represent a complex RNA motif, which involves also hairpin la (see also 
below). 

30 As an important result which suggests a host-factor mediated 5 '-3' cross-talk of 

the viral RNA, fractions containing vRbp84 and vRbp45 were found to precipitate 
radioactively labeled transcripts covering the 3'UTR via biotinylated transcripts which 
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encompass the 5'UTR (Fig. 7). These experiments are currently repeated with the 
purified NF90/NFAR-1 and NF45 proteins. 

In summary, the presented data provide evidence for the formation of a specific 
viral/ cellular RNP complex critically involved in translation and RNA replication or 
5 the coordinated regulation of translation and replication of B VDV RNA. (i) Association 
of the vRbps with the viral RNA involves the aforementioned "bi-functionaT RNA 
motifs: i.e., the hairpin la structure at the 5'-end and the UGA box elements at the 3'- 
end of the RNA. (ii) Inhibition of binding of the vRbps to the 5' or 3'-end of the viral 
RNA strictly correlates with inhibition of translation and/or replication of the viral 

10 RNA. (iii) The modification of UGA box elements in the 3'UTR cause a less efficient 
termination of translation. Accordingly, the replication deficiency of UGA box mutants 
may be explained by a disturbed coordination of translation versus replication, or, in 
other words, by an interference of the translation with the replication machinery, (iv). 
As strongly indicated by the coprecipitation experiments, simultaneous binding of the 

15 vRbps to the 5' as well as to the 3* -end may bring about a physical and functional link 
between both ends of the viral RNA and may thus enable feed-back regulation between 
the translation and replication machinery (Fig. 7). Along this line, it is possible that the 
RNP complex and associating viral protein(s) (e.g. NS5A) contribute to the 
displacement of ribosomes from the RNA. Alternatively, it is conceivable that the state 

20 of the assembling replication complex at the 3'-end of the viral RNA modulates 
translation initiation via 3'-5' cross talk (53). 

Example 2: The same set of cellular proteins associates with the UTRs of different 
types of picornaviruses and hepatitis C virus. 

25 Association of the entire set of Rbps (vRbpl30, vRbpl20, vRbpllO, vRbp84, 

vRbp64 and vRbp45) was also detected with the 5'UTR and 3'UTR of other 
pestiviruses such as CSFV (53). Moreover, the vRbps were determined to bind also to 
the UTRs of several other RNA viruses (Fig. 8). (i) Although the cross-linking signals 
were weak, binding of these factors to the 5'UTR of HCV was clearly detectable. No 

30 label-transfer occurred with different transcripts of the HCV 3'UTR (see below), (ii) 
Intriguingly, an identical label transfer pattern was observed during cross-linking 
experiments which applied the 5'UTR or the 3'UTR of HAV (hepatitis A virus) strain 
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HM175 (Fig. 8; the 3'data were already published by Kusov et al., 58). (iii) Binding of 
the vRbps was also found with the 5'UTR of Rhinovirus type 14 and (iv) the 5'UTR of 
EMCV (encephalomyocarditis virus) (Fig. 8). The 3'UTRs of the latter viruses have not 
yet been tested. Hence, association of vRbpl30, vRbpl20, vRbpllO, vRbp84, vRbp64, 
5 and vRbp45 was observed with the 5'UTR of viruses harboring a type I (Entero- 
/Rhinoviruses), type II (Cardio-/Aphtoviruses), type HI (Hepatitis A virus) or type IV 
(hepatitis C virus/pestiviruses) IRES element. As with the BVDV and CSFV 5' and 
3'UTR, the specificity of the RNA-protein interactions was confirmed by cross- 
competition experiments: for example, the association of the proteins to the HCV 

10 5'UTR could be chased by RNA transcripts comprising the HAV 5'UTR etc. In 
contrast, non-related RNAs, such as t-RNA or diverse mRNA transcripts did not 
compete the binding of the proteins to the viral RNAs (53). 

As shown in Fig. 8, the amounts of transferred label differed significantly 
between the various test RNAs. Considering that the supposed protein interaction 

15 site(s) (see below) of each of the different 5'UTRs comprise a variant number of 
labeled nucleotides, the data are difficult to interpret in terms of the efficiency of a 
certain RNA/protein interaction. Once the identified vRbps (see below) become 
available in purified form, more meaningful techniques can be applied to confirm the 
efficiency of the interaction of these factors to the different UTRs as well as to 

20 elements (such as the HCV 3'UTR), which yielded a negative result during 
crosslinking experiments. 

As already mentioned, neither the 5'UTRs of the diverse members of the 
Picornaviridae family nor the 5'UTRs of hepatitis C virus or pestiviruses contain UGA 
box-like sequence elements. However, despite limited sequence identity, the structural 

25 and functional organization is highly shared between IRESes of the same type (e.g. 
between HCV and pestiviruses, see also Fig. 4). Moreover, computer derived RNA 
folding and phylogenetic comparative analyses suggested a common "IRES core- 
domain" for the different picornaviruses as well as for the divergent hepatitis C virus 
and pestiviruses (59). Tests whether this structure motif, which involves approximately 

30 100 nucleotides near the translation initation codon, and which covers the 40S 
interaction domain (see above), may represent the common binding site of vRbpl30, 
vRbpl20, vRbpllO, vRbp84, vRbp64 and vRbp45 within the picornavirus, HCV and 
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pestivirus IRES are underway. Initial indications that the core-IRES may represent a 
part of the protein binding site came from an RNase H digestion approach, which 
allowed the purification of the correctly folded 3' 150 nucleotides of the 5'UTR of 
HAV. As shown by UV crosslinking, this region, which corresponds almost exactly to 
5 the proposed IRES core-domain, assembles indeed the entire set of vRbps (Fig. 9). 
Considering the core-IRES as a major vRbp binding site, the fact that with the BVDV 
system formation of the S'-terminal hairpin la motif was found to be important for 
efficient interaction of the vRbps with the 5'UTR may be interpreted in two ways, (i) 
Formation of haiipin la may have a cooperative effect on the folding of the IRES core- 
10 domain and/or (ii) elements of hairpin la are in contact with parts of the core IRES (see 
also below). Taken together, these data suggest that the set of vRbp proteins associate 
with a complex, common RNA motif harbored by the 5'UTRs of the different virus 
species. 

By chromatographic methods, the Applicants purified vRbp84 from cytoplasmic 

15 fractions of HeLa cells and determined its identity by mass-spectroscopy. The cellular 
factor identified herein, namely a member of the NF90 family (see below), was distinct 
from those reported by other laboratories as to interact with the genomes of 
picornaviruses, HCV and pestiviruses, respectively (see above). The fact that NF90 (or 
a close relative of this protein, see below) corresponds to the originally detected 

20 vRbp84 was verified by three types of experimental procedures: (i) By comparison of 
the gel retardation factor (RF) of the immunostained and the crosslinked/labeled protein 
on SDS-PAGE (Fig. 10). (ii) Via RNA coprecipitation experiments with the in vitro 
translated [ 35 S]-labeled NF90; i.e. coprecipitation of the protein could be exclusively 
detected with a specific RNA probe but not with an unrelated RNA (Fig. 10). (iii) By 

25 RMSA (RNA mobility shift assays) and "super-shifts" with DNF90 antibodies and 
different viral 5'UTRs (Fig. 10). 

NF90/NFAR-1 is a double-stranded RNA binding protein, which has been 
originally characterized as a NFAT (nuclear factor of activated T cells)-binding 
component of the antigen receptor response element (ARRE) from the interleukin 2 

30 promoter (60). Subsequently, it was designated as NF90 and NFAR-1, respectively (61, 
62). The protein, which is present in the nucleus as well as in the cytoplasm of the cell, 
harbours a bipartite nuclear localization domain (NLS) and two dsRBMs (double-strand 
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RNA binding motifs; reviewed in reference 63; see also Fig. 10). The coding gene is 
the so-called interleukin enhancer binding factor 3 gene (ILF3), which has been 
mapped to chromosome 19 in humans and to chromosome 9 in mice. The human gene 
spans 38 kb and is divided into 21 exons. Different reports indicate that a series of 
5 isofonns are expressed due to alternative splicing of the same mRNA. The protein 
isoforms diverge only at the carboxiterminal region of the proteins (64). Besides NF90 
and NFAR-1 which differ for other reasons (see below) by 109 AA at the C-terminus, 
two isofonns were so far characterized. These are the TCP ("translational control 
protein"; 65), which, with respect to NFAR-1, differs by 15 AA at the C-tenninus and 

10 contains 62 additional AA residues, and the NFAR-2, which differs by 15 AA at the C- 
tenninus and contains 192 additional AA residues (62). Interestingly, different 
members of this protein-family are identified by autoantibodies of patients and mice 
with systemic autoimmune diseases (66). NF90/NFAR-1 was shown to bind to and to 
be a substrate of PKR (67). In accord with this finding, NFAR-1 and NFAR-2, which 

15 are suggested to be involved in gene-expression processes (62, 68), share a striking 
homology with eIF2D (62). 

In the course of the cloning procedure, the Applicants found that the only 
difference between the cDNA of NF90 and NFAR-1 concerns a two base-pair frame- 
shift in the NF90 cDNA clone of Kao et aL, which consequently leads to the expression 

20 of a different C-terminus of NF90. In other words, NF90 and NFAR-1 are not 
alternatively spliced forms of the same mRNA but represent most probably the same 
protein. In conclusion, the so-called NF90 protein family consists of three known 
members: NF90/NFAR-1 (calculated molecular weight, ca. 78 kDa), TCP (differs by 
ca. 7 kDa with respect to NFAR-1 - accordingly, it has a calculated molecular weight 

25 of ca. 85 kDa) and NFAR-2 (calculated molecular weight, ca. 99 kDa). The N-terminal 
690 A A residues are identical in all three family members (Fig. 10). Accordingly 
western-blots performed with an DNF90 antiserum (61) on total cytoplasmic proteins 
of HeLa cells stained a set of protein bands migrating at molecular weights of about 84, 
90 and 110 kDa on SDS PAGE (Fig. 11), which were suggested to correspond to 

30 NF90/NFAR-1, TCP and NFAR-2, respectively. Moreover, the DNF90 antiserum 
clearly stained a protein with a molecular weight of 64 kDa, which was accordingly 
suggested to represent a yet unknown, additional isoform of the NF90 family (Fig. 11). 
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Strikingly, the overall pattern and the RF values of the proteins that are stained 
by the DNF90 antiserum are virtually congruent with the pattern and RF values of the 
vRbps labelled during UV cross-linking/label transfer experiments with the different 
viral RNA probes (see Fig. 1 1 and above). Li view of these data, it is reasonable to 
5 suggest that vRbpllO corresponds to NFAR-2 and that vRbp64 represents the 
aforementioned 64 kDa NF90 isoform. vRbp84, which with HeLa extracts generally 
separates as a double band on SDS-PAGE (see Fig. 5), should thus correspond to 
NF90/NFAR-1 and TCP, respectively. 

In the course of the purification procedure, vRbp84 was found to co-fractionate 

10 with vRbp45 (53). Considering vRbpllO, vRbp84 and vRbp64 as members of the 
NF90 family, it was a natural suspicion that vRbp45 represents the so-called NF45 
protein. NF45 was previously shown to form a stable complex with NF90 (61) and to 
modulate the function of NF90 (67). NF45 has a distant homology to the prokaryotic 
transcription factor D-54; like NF90, it is a substrate of PKR phosphorylation (67). 

15 Western-blots and RMS A with ONF45 antiserum (see Fig. 10 and 1 1) as well as RNA- 
protein coprecipitation experiments confirmed that vRbp45 represents indeed NF45. 

Observations of other laboratories make it very likely that vRbpl20 corresponds 
to RNA helicase A (RHA), which represents a further dsRNA binding protein. RHA is 
suggested to play a pivotal role in the regulation of transcription of the cell (reviewed in 

20 reference 63). This assumption is based on experiments with adenoviral RNAs, which 
associate NF90, NF45 and RNA helicase A. Coprecipitation experiments indicated that 
RNA helicase A (MW ca. 120-130 kDa) is tightly associated, with NF90 and NF45 
(69). Thus, with the exception of vRbpl30 (which is suspected to represent a 
modification of RNA helicase A), the Applicants have direct and indirect evidence of 

25 the identity of the entire set of vRbps. Experiments applying the purified proteins to 
unambiguously confirm the identity of vRbpl30, vRbpl20 as well as that of vRbpllO 
and vRbp64 are in progress. 

The striking similarities of pestiviruses and HCV concerning in particular the 
organization of the 5'UTR and of the NS3 to NS5B coding region, suggest a similar 

30 mode of translation initiation/termination, RNA replication and the coordination of 
both processes (see Fig. 6). Apart from the crosslinking/label transfer data shown in 
Fig. 8, the Applicants accumulated further evidences indicating an important functional 
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role of the here-described vRbp proteins in the life-cycle of HCV. (i) The interaction of 
the vRbps with the HCV IRES was observed to be significantly stimulated if the HCV 
5'UTR acquired four nucleotides "GUAU" (corresponding to the essential part of the 
BVDV hairpin la motif, see Fig. 4) at the S'-terminus (Fig. 12). In agreement with this 
5 observation, chimeric BVDV RNA (BVDV viral genome with the 5*BVDV UTR 
replaced by the HCV 5'UTR) was shown to be unable to replicate; however, upon 
fusion of GUAU to the 5'-end of the HCV 5'UTR, replication of this chimera was 
restored (37). Although these data are difficult to interpret, they support the above idea 
of a hairpin la- IRES core-domain interaction as well as of a vRbp-mediated 

10 crosstalking of the 5' and 3'-end of the viral RNA. (ii) Despite of the aforementioned 
differences between the 3'V regions of the BVDV and HCV 3'UTRs (see Fig. 3), 
genetic data strongly suggest an analogous ftinctional role of the BVDV and the HCV 
3'V portion. Thus, deletion of the HCV SLs to p structure (which includes also the 
pseudo-stop element; see Fig. 3) yielded an RNA derivative, which is replication 

15 deficient (Fig. 6). Intriguingly, a HCV/BVDV chimera where the HCV SL^p was 
substituted by the BVDV SLstop (the latter, which associates the entire set of vRbps 
through the contained UGA box elements), turned out to be replication competent (Fig. 
12). Studies examining whether the Replication deficiency of the HCV DSLstop mutant 
is caused by the read-through of ribosomes of the translational stop-codon are in 

20 progress. 

In sum, these data implicate the here-characterized set of vRbps directly in HCV 
translation and replication. The detailed knowledge on the function of these newly 
identified vRbp factors is hence expected to considerably help to explore particular 
features of the RNA replication pathway, host range and pathogenesis (e.g. the reasons 
25 for chronic infections) of this insidious pathogen. To examine whether the host range of 
HCV (see below) may be defined by the efficiency of RNP formation, the functional, 
chimeric HCV/BVDV replicons are currently tested in terms of their replication 
capability in cells other than human hepatoma cells. 

30 Methods capable of i) modulating the binding of cellular proteins to their 

supposed common binding site on viral RNA, the entire IRES core-domain, or yet 
undefined elements herein, or ii) modulating the biological activity of agonists and 
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antagonists of yet unknown identity, for example viral proteins, would be valuable to 
prevent or treat diseases induced by divergent viruses. 

The present invention relates to a specific interaction between a set of cellular 
proteins and the untranslated regions of a broad range of different viral RNAs. In 
5 particular, interactions involving all known types of viral IRES elements. The 
specificity of the formation of the viral RN A/cellular RNP complex was demonstrated 
by cross-linking and competition data as well as by coprecipitation experiments and 
RNA mobility shift assays, which were performed with individually expressed proteins 
and/or specific antisera, respectively (see Fig. 10 and Fig. 1 1). The identification of 
10 vRbp84 and vRbp45 as NF90/NFAR-1 and NF45 by purification, microsequencing 
and/or biochemical and immunological procedures enabled conclusions on the identity 
of the other vRbps. Thus, vRbp 64 and vRbp 110 were indicated to correspond to 
related isoforms of NF90/NFAR-1, while vRbpl20 was suggested to represent RNA 
helicase A. 

1 5 The Applicants show the importance of vRbp 120 for BVDV and HC V by RNAi 

approaches (see Fig 13). These approaches indicated that HCV viral replication is 
inhibited in vRbpl20 knockouts. As indicated by the data that derived from the BVDV 
and HCV systems, the function(s) of the cellular vRbps appear to be critically 
associated with translation and replication of the viral RNA or the regulation of both 

20 processes. The fact that NF90/NFAR-1 and its relatives as well as NF45 and RHA are 
phosphorylated by PKR and that NF90/NFAR-1 and NF45 bind to PKR, suggests that 
the formation of the viral/cellular RNP may have the task to modulate the function of 
PKR by inhibiting its antiviral activity (see reference 67). Using antisera against the 
vRbpl20, the Applicants were able to perform RMSA and colocalization studies via IF. 

25 Moreover, by IF, the Applicants determined that the NFs and vRbpl20 (RHA) are 
translocated from the nucleus to the cell's cytoplasm in transfected cells. 

The efficiency of the formation of the viral/cellular RNP complex may thus 
represent a molecular determinant of the host-range of the viral RNA. Hence, the 
limited host-range of HCV with respect to pestiviruses may be a consequence of the 

30 low capability of the HCV RNA to assemble the vRbps. 

Taken together, the present invention suggests a universal role of dsRNA 
binding proteins, particularly several members of the NF90 family as well as of NF45 
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and RNA helicase A, in the life cycle of Picornaviruses and Flaviviruses. The 
development of strategies capable to inhibit either binding of the vRbps to then- 
supposed common binding site on the viral RNA, the entire IRES core-domain or yet 
undefined elements herein, or the biological activity of agonists and antagonists of yet 
5 unknown identity, for example viral proteins, would be valuable to treat diseases 
induced by these divergent viruses. 



All documents cited herein and patent applications to which priority is claimed are 
incorporated by reference herein in their entirety. This invention is not to be limited in 
10 scope by the specific embodiments described herein. Indeed, various modifications of 
the invention in addition to those described herein will become apparent to those skilled 
in the art from the foregoing description. Such modifications are intended to fall within 
the scope of the appended claims. The disclosures of the patents, patent applications 
and publications cited herein are incorporated by reference in their entireties. 

15 SEQUENCE LISTING 



vRbpl20 amino acid sequence(SEQ ID NO:l): 



MGDVKNFLYAWCGKRKMIPSYEIRAW 
20 DAQSNAARDFVNYLVRINin 

PHLALKAENNSEVGASGYGWGPTWDRGANL 

GLHGNWTLENAKARI^QYFQ^^ 

REHGSNKKIAAQSCALSLVRQLYHLGV^ 

LQNHQEIJSlLErLPPPEDPSW 
25 NIDEGPIJVFATPEQISMDLKNELM 

WHRGATGCGKTTQWQFILDDnQNDRAAECNIVW 

GKSCGYSVRraSEJPRPHASIMFC^ 

LRDWQAYPEVRIVLMSATTOTSMFCTYFFNCPnEVYGR 

PPKDKKKKDKDDDGGEDDDANCNLICGDEY 
30 ETLNWGAVLVFIJPGWNLIYTMQKHLEMW 

VGVTKVII^TNIAETSIT^ 

KGRAGRSTAGFOFHLCSRARFERI^THMTPEMFRTPL 
EPPPLDAVIEAEHTLRELDALDANDELTPLGRILAKLPIH 
TIAAATCOTEPFINEGKRLGYIHRNFAGNR^ 
35 EHKRLNMATLRMTWEAKVQLJ^IL 

VYPWCTHKEKRHLTIEGRNALfflKSSVNCPFSSQDM 
MTLVPPLQIIJLFASKKVQSDGQrVLVDD^ 

QPAHSQLDPVNERMLNMIRQISRPSAAG1MMIGSTRYGDGPRPPKMARTO 
GGSSYSGGGYGGGYSSGGYGSGGYGGSANSFRAGYGAGVGGGYRGVSRGGFRGNSG 
40 GDYRGPSGGYRGSGGFQRGGGRGAYGTGYFGQGRGGGGY 
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vRbpl20 nucleic acid sequence(SEQ ID NO:2): 

atgggtgacg ttaaaaattt tctgtatgcc tggtgtggca aaaggaagat gaccccatcctatgaaatta gagcagtggg 
gaacaaaaac aggcagaaat tcatgtgtga ggttcaggtggaaggttata attacactgg catgggaaat tccaccaata 
5 aaaaagatgc acaaagcaatgctgccagag actttgttaa ctatttggtt cgaataaatg aaataaagag 
tgaagaagttccagcttttg gggtagcatc tccgccccca cttactgata ctcctgacac tacagcaaatgctgaaggag 
atttaccaac aaccatggga ggacctcttc ctccacatct ggctctcaaagcagaaaata attctgaggt aggggcctct 
ggctatggtg ttcctgggcc cacctgggaccgaggagcca acttgaagga ttactactca agaaaggaag aacaagaagt 
gcaagcgactctagaatcag aagaagtgga tttaaatgct gggcttcatg gaaactggac cttggaaaatgctaaagctc 

10 gtctaaacca atattttcag aaagaaaaga tccaaggaga atataagtacacccaagtgg gtcctgatca caacaggagc 
tttattgcag aaatgaccat ttatatcaagcagctgggca gaaggatttt tgcacgagaa catggatcaa ataagaaatt 
ggcagcacagtcctgtgccc tgtcacttgt cagacaactg taccatcttg gagtggttga agcttactccggacttacaa 
agaagaagga aggagagaca gtggagcctt acaaagtaaa cctctctcaagatttagagc atcagctgca aaacatcatt 
caagagctaa atcttgagat tttgcccccgcctgaagatc cttctgtgcc agttgcactc aacattggca aattggctca 

15 gttcgaaccatctcagcgac aaaaccaagt gggtgtggtt ccttggtcac ctccacaatc caactggaatccttggacta 
gtagcaacat tgatgagggg cctctggctt ttgctactcc agagcaaataagcatggacc tcaagaatga attgatgtac 
cagttggaac aggatcatga tttgcaagcaatcttgcagg agagagagtt actgcctgtg aagaaatttg aaagtgagat 
tctggaagcaatcagccaaa attcagttgt cattattaga ggggctactg gatgtgggaa aaccacacaggttccccagt 
tcattctaga tgactttatc cagaatgacc gagcagcaga gtgtaacatcgtagtaactc agcccagaag aatcagtgcg 

20 gtttctgtgg cagagcgagt tgcatttgaaagaggagaag agcctggaaa aagctgtggc tacagcgttc gatttgagtc 
tatacttcctcgtcctcatg ccagtataat gttttgtact gtaggtgtgc tcctgagaaa attagaagcaggcattcgag gaatcagtca 
tgtaattgta gatgaaatac atgaaagaga tattaatactgacttccttc tggtagtact gcgtgatgtt gttcaggctt atcctgaagt 
tcgcattgttcttatgtctg ctactattga taccagcatg ttttgtgaat atttcttcaa ttgccccatcattgaagttt atgggaggac 
ttacccagtt caagaatatt ttctggaaga ctgcattcagatgacccact ttgttcctcc accaaaagac aaaaagaaga 

25 aggataagga tgatgatggtggtgaggatg atgatgcaaa ttgcaacttg atctgtggtg atgaatatgg 
tccagaaacaaggttgagca tgtctcaatt gaacgaaaag gaaactcctt ttgaactcat cgaggctctacttaagtaca 
ttgaaaccct taatgttcct ggagctgtgt tggttttttt gcctggctggaatctgattt atactatgca gaagcatttg gaaatgaatc 
cacattttgg aagccatcggtatcagattc tacccctgca ttctcagatt cctcgagagg aacagcgcaa 
agtgtttgatccagtaccag ttggagtaac caaggttatt ttgtccacaa atattgctga aacaagcattaccataaacg atgttgttta 

30 tgtcattgac tcctgcaagc agaaagtgaa actcttcactgctcacaaca atatgaccaa ctattctacc gtatgggcat 
caaaaacaaa ccttgagcaacggaaagggc gagctggccg gagtacggct ggattctgct ttcacctgtg 
cagccgagctcgttttgaga , gacttgaaac ccacatgaca ccagagatgt tccgaacacc attgcatgaaattgctctta 
gcataaaact tctgcgtcta. ggaggaattg gccaatttct ggccaaagcaattgaacctc cccctttgga tgctgtgatt 
gaagcagaac acactcttag agagcttgatgcattagatg ccaatgatga gttgactcct ttgggacgaa tcctggctaa 

35 actccccattgagcctcgtt ttggcaaaat gatgataatg gggtgtattt tctacgtggg agatgctatctgtaccattg ctgctgctac 
ctgctttcca gagcctttca tcaatgaagg aaagcggctgggctatatcc atcgaaattt tgctggaaac agattttctg 
atcacgtagc ccttttatcagtattccaag cctgggatga tgctagaatg ggtggagaag aagcagagat 
acgtttttgtgagcacaaaa gacttaatat ggctacacta agaatgacct gggaagccaa agttcagctcaaagagattt tgattaattc 
tgggtttcca gaagattgtt tgttgacaca agtgtttactaacactggac cagataataa tttggatgtt gttatctccc tcctggcctt 

40 tggtgtgtaccccaatgtat gctatcataa ggaaaagagg aagattctca ccactgaagg gcgtaatgcacttatccaca 
aatcatctgt taattgtcct tttagtagcc aagacatgaa gtacccatctcccttctttg tatttggtga aaagattcga actcgagcca 
tctctgctaa aggcatgactttagtacccc ccctgcagtt gcttctcttt gcctccaaga aagtccaatc tgatgggcagattgtgcttg 
tagatgactg gattaaactg caaatatctc atgaagctgc tgcctgtatcactggtctcc gggcagccat ggaggctttg 
gttgttgaag taaccaaaca acctgctatcatcagccagt tggaccccgt aaatgaacgt atgctgaaca tgatccgtca 

45 gatctctagaccctcagctg ctggtatcaa ccttatgatt ggcagtacac ggtatggaga tggtccacgtcctcccaaga 
tggcccgata cgacaatgga agcggatata gaaggggagg ttctagttacagtggtggag gctatggcgg tggctatagc 
agtggaggct atggtagcgg aggctatggtggcagcgcca actcctttcg ggcaggatat ggtgcaggtg ttggtggagg 
ctatagaggagtttcccgag gtggctttag aggcaactct ggaggagact acagagggcc tagtggaggctacagaggat 
ctgggggatt ccagcgagga ggtggtaggg gggcctatgg aactggctactttggacagg gaagaggagg tggcggctat 
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vRbpllO amino acid sequence(SEQ ID NO:3): 
MRPMRIFVNDDRHVMAKHSSVYPTQEELEAV^ 

SEQAESDNMDWPEDDSKEGAGEQKTEHMTRTLRGVMRVGLVAKCIXLKGD 
5 VIIXTKEKPTTAIJLDKV^ 

SPVVREEMEKVIAGETLSVNDPPDVL^^ 
mVLRDLCTRVPTWGPUlGWPIJELLCEKSro^ 
GSGIYDPCEKEATDAIGHLDR(^REDITQSAQHALRLAAFGQLHKV^ 
KPKNENPVDYTVQIPPSTTYAITPMKRPMEEI^ 
10 NALMRIJNQLKPGLQYKLVSQTGPVHAPIFTMSV 

VLQDMGUTGAEGRDSSKGEDSAEETEAKPAWAPAPVVEAVSTPSAAFP 

PILTKHGKNPVMEUraKRRGLK^ 

AKAYAALAAI^KU^DIPIJ^DANKKKRAPW 

EVPPPPNLRGRGRGGSIRGRGRGRGFGGANHGGYMNAGAGYGSYGYGGNSATAGYS 
15 QFYSNGGHSGNASGGGGGGGGGSSGYGSYYQGDNYNSPVPPKHAGKKQPHGGQQKP 
SYGSGYQSHQGQQQSYNQSPYSNYGPPQGKQKGYNHGQGSYSYSNSYNSPGGGGGS 
DYNYESKFNYSGSGGRSGGNSYGSGGASYNPGSHGGYGGGSGGGSSYQGKQGGYSQ 
SNYNSPGSGQNYSGPPSSYQSSQGGYGRNADHSMNYQYR 

20 vRbpl 10 nucleic acid sequence(SEQ ID NO:4): 

atgcgtccaa tgcgaatttt tgtgaatgat gaccgccatg tgatggcaaa gcattcttccgtttatccaa cacaagagga 
gctggaggca gtccagaaca tggtgtccca cacggagcgggcgctcaaag ctgtgtccga ctggatagac gagcaggaaa 
agggtagcag cgagcaggcagagtccgata acatggatgt gcccccagag gacgacagta aagaaggggc 

25 tggggaacagaagacggagc acatgaccag aaccctgcgg ggagtgatgc gggtgggcct ggtggcaaagtgcctcctac 
tcaaggggga cttggatctg gagctggtgc tgctgtgtaa ggagaagcccacaaccgccc tcctggacaa ggtggccgac 
. aacctggcca tccagcttgc tgctgtaacagaagacaagt acgaaatact gcaatctgtc gacgatgctg cgattgtgat 
aaaaaacacaaaagagcctc cattgtccct gaccatccac ctgacatccc ctgttgtcag agaagaaatggagaaagtat 
tagctggaga aacgctatca gtcaacgacc ccccggacgt tctggacaggcagaaatgcc ttgctgcctt ggcgtccctc 

30 cgacacgcca agtggttcca ggccagagccaacgggctga agtcttgtgt cattgtgatc cgggtcttga gggacctgtg 
cactcgcgtgcccacctggg gtcccctccg aggctggcct ctcgagctcc tgtgtgagaa atccattggcacggccaaca 
gaccgatggg. tgctggcgag gccctgcgga gagtgctgga gtgcctggcgtcgggcatcg tgatgccaga tggttctggc 
atttatgacc cttgtgaaaa agaagccactgatgctattg ggcatctaga cagacagcaa cgggaagata tcacacagag 
tgcgcagcacgcactgcggc tcgctgcctt cggccagctc cataaagtcc taggcatgga ccctctgccttccaagatgc 

35 ccaagaaacc aaagaatgaa aacccagtgg actacaccgt tcagatcccaccaagcacca cctatgccat tacgcccatg 
aaacgcccaa tggaggagga cggggaggagaagtcgccca gcaaaaagaa gaagaagatt cagaagaaag aggagaaggc 
agagcccccccaggctatga atgccctgat gcggttgaac cagctgaagc cagggctgca gtacaagctggtgtcccaga 
ctgggcccgt ccatgccccc atctttacca tgtctgtgga ggttgatggcaattcattcg aggcctctgg gccctccaaa 
aagacggcca agctgcacgt ggccgttaaggtgttacagg acatgggctt gccgacgggt gctgaaggca gggactcgag 

40 caagggggaggactcggctg aggagaccga ggcgaagcca gcagtggtgg cccctgcccc agtggtagaagctgtctcca 
cccctagtgc ggcctttccc tcagatgcca ctgccgagca ggggccgatcctgacaaagc acggcaagaa cccagtcatg 
gagctgaacg agaagaggcg tgggctcaagtacgagctca tctccgagac cgggggcagc cacgacaagc gcttcgtcat 
ggaggtcgaagtggatggac agaagttcca aggtgctggt tccaacaaaa aggtggcgaa ggcctacgctgctcttgctg 
ccctagaaaa gcttttccct gacacccctc tcgcccttga tgccaacaaaaagaagagag ccccagtacc cgtcagaggg 

45 ggaccgaaat ttgctgctaa gccacataaccctggcttcg gcatgggagg ccccatgcac aacgaagtgc ccccaccccc 
caaccttcgagggcggggaa gaggcgggag catccgggga cgagggcgcg ggcgaggatt tggtggcgccaaccatggag 
gctacatgaa tgccggtget gggtatggaa gctatgggta cggaggcaactctgcgacag caggctacag tcagttctac 
agcaacggag ggcattctgg gaatgccagtggcggtggcg gcgggggcgg tggtggctcc tccggctatg gctcctacta 
ccaaggtgacaactacaact caccggtgcc cccaaaacac gctgggaaga agcagccgca cgggggccagcagaagccct 

50 cctacggctc gggctaccag tcccaccagg gccagcagca gtcctacaaccagagcccct acagcaacta tggccctcca 
cagggcaagc agaaaggcta taaccatggacaaggcagct actcctactc gaactcctac aactctcccg ggggcggggg 
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cggatccgactacaactacg agagcaaatt caactacagt ggtagtggag gccgaagcgg cgggaacagctacggctcag 
gcggggcatc ctacaaccca gggtcacacg ggggctacgg cggaggttctgggggcggct cctcatacca aggcaaacaa 
ggaggctact cacagtcgaa ctacaactccccggggtccg gccagaacta cagtggccct cccagctcct accagtcctc 
^ acaaggcggctatggcagaa acgcagacca cagcatgaac taccagtaca gataa 

vRbp84 amino acid sequence(SEQ ID N0:5): 
MRPMRIFVNDDRHVMAK^ 

SEQAESDNMDWPEDDSKEGAGEQKTEHMTRTLRGVMRVGLVAKQXLKGDLDLEL 

VIXCKEKPTTAIXDKVADM-MQLAAVTEDKYEILQSW 
10 SPVVREEMEKVLAGETI^V^ 

IRVLRDIJjmVPTWGPLRGWPl^LU^KSIGTANI^ 

GSGIYDPCEKEATDAIGHLDRQQREDITQSAQHALRIAAFG^ 

KPKMENPVDYTVQIPPSTTYArrPMKRPMEED 

NALMRLNQLKPGLQYKLVSQTGFVHAPIFIMSVEVDGNSFEA^ 
15 V1XJDMGLFTGAEGRDSSKGEDSAEETEAKPAWAPAPVVEAVSTPSAAFPSDATAEQG 

PILTKHGKNPVMEIJreKRRGLKY^^ 

AKAYAAIAALEKLFPDTPLALDANKKKRAPVPVR^ 

EWPPPM^RGRGRGGSIRGRGRGRGFGGANHGGYMNAGAGYGSYGYGGNSATAGYS 
DFFTDCYGYHDFGSS 

20 

vRbp84 nucleic acid sequence(SEQ ID N0:6): 



atgcgtccaa tgcgaatttt tgtgaatgat gaccgccatg tgatggcaaa gcattcttccgtttatccaa cacaagagga 
gctggaggca gtccagaaca tggtgtccca cacggagcgggcgctcaaag ctgtgtccga ctggatagac gagcaggaaa 

25 agggtagcag cgagcaggcagagtccgata acatggatgt gcccccagag gacgacagta aagaaggggc 
tggggaacagaagacggagc acatgaccag aaccctgcgg ggagtgatgc gggtgggcct ggtggcaaagtgcctcctac 
tcaaggggga cttggatctg gagctggtgc tgctgtgtaa ggagaagcccacaaccgccc tcctggacaa ggtggccgac 
aacctggcca tccagcttgc tgctgtaacagaagacaagt acgaaatact gcaatctgtc gacgatgctg cgattgtgat 
aaaaaacacaaaagagcctc cattgtccct gaccatccac ctgacatccc ctgttgtcag agaagaaatggagaaagtat 

30 tagctggaga aacgctatca gtcaacgacc ccccggacgt tctggacaggcagaaatgcc ttgctgcctt ggcgtccctc 
cgacacgcca agtggttcca ggccagagccaacgggctga agtcttgtgt cattgtgatc cgggtcttga gggacctgtg 
cactcgcgtgcccacctggg gtcccctccg aggctggcct ctcgagctcc tgtgtgagaa atccattggcacggccaaca 
gaccgatggg tgctggcgag gccctgcgga gagtgctgga gtgcctggcgtcgggcatcg tgatgccaga tggttctggc 
atttatgacc cttgtgaaaa agaagccactgatgctattg ggcatctaga cagacagcaa cgggaagata tcacacagag 

35 tgcgcagcacgcactgcggc tcgctgcctt cggccagctc cataaagtcc taggcatgga ccctctgccttccaagatgc 
ccaagaaacc aaagaatgaa aacccagtgg actacaccgt tcagatcccaccaagcacca cctatgccat tacgcccatg 
aaacgcccaa tggaggagga cggggaggagaagtcgccca gcaaaaagaa gaagaagatt cagaagaaag aggagaaggc 
agagcccccccaggctatga atgccctgat gcggttgaac cagctgaagc cagggctgca gtacaagctggtgtcccaga 
ctgggcccgt ccatgccccc atctttacca tgtctgtgga ggttgatggcaattcatteg aggcctctgg gccctccaaa 

40 aagacggcca agctgcacgt ggccgttaaggtgttacagg acatgggctt gccgacgggt gctgaaggca gggactcgag 
caagggggaggactcggctg aggagaccga ggcgaagcca gcagtggtgg cccctgcccc agtggtagaagctgtctcca 
cccctagtgc ggcctttccc tcagatgcca ctgccgagca ggggccgatc!561ctgacaaagc acggcaagaa cccagtcatg 
gagctgaacg agaagaggcg tgggctcaagtacgagctca tctccgagac cgggggcagc cacgacaagc gcttcgtcat 
ggaggtcgaagtggatggac agaagttcca aggtgctggt tccaacaaaa aggtggcgaa ggcctacgctgctcttgctg 

45 ccctagaaaa gcttttccct gacacccctc gcccttga tgccaacaaaaagaagagag ccccagtacc cgtcagaggg 
ggaccgaaat ttgctgctaa gccacataaccctggcttcg gcatgggagg ccccatgcac aacgaagtgc ccccaccccc 
caaccttcgagggcggggaa gaggcgggag catccgggga cgagggcgcg ggcgaggatt tggtggcgccaaccatggag 
gctacatgaa tgccggtgct gggtatggaa gctatgggta cggaggcaactctgcgacag caggctacag tgactttttc 
acagactgct acggctatca tgattttgggtcttcctag 

50 
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vRbp45 amino acid sequence (SEQ ID NO:7): 

MRGDRGRGRGGRFGSRGGPCKKjFRPF^ 
APNSAEQASII^LVTKINNVIDNLIVAPGTFEVQIEEVRQV 
5 T1JBAVAALGNKVVESLRAQDPSEVLTMLTNETGFEISS 
VLQSALAAmHARWFEENASQSTVKVIJG^ 
LAIJWAYRRCLQILAAGIJ^PGSVGITO 
FRKIUjQEGDASYLASEISTWDGVIVTPSEKAYEK^^ 
VTFPSLLFLPKGKTGA 

10 

vRbp45 nucleic acid sequence (SEQ ID NO:8): 

atgaggggtg acagaggccg tggtcgtggt gggcgctttg gttccagagg aggcccaggaggagggttca ggccctttgt 
15 accacatatc ccatttgact tctatttgtg tgaaatggcctttccccggg tcaagccagc acctgatgaG acttccttca 
gtgaggcctt gctgaagaggaaCcaggacc tggctcccaa ttctgctgaa caggcatcta tcctttctct 
Agtgacaaaaataaacaatg tgattgataa tctgattgtg gctccaggga catttgaagt gcaaattgaagaagttcgac 
aggtgggatc ctataaaaag gggacaatga ctacaggaca caatgtggctgacctggtgg tgatactcaa gattctgcca 
acgttggaag ctgttgctgc cctggggaacaaagtcgtgg aaagcctaag agcacaggat ccttctgaag ttttaaccat 
20 gctgaccaacgaaacAggct ttgaaatcag ttcttctgat gctacagtga agattctcat tacaacagtgccacccaatc 
ttcgaaaact ggatccagaa ctccatttgg atatcaaagt attgcagagtgccttagcag ccatccgaca tgcccgctgg 
ttcgaggaaa atgcttctca gtccacagttaaagttctca tcagactact gaaggacttg aggattcgtt ttccCggctt 
tgagcccctcacaccctgga tccttgacct actaggccat tatgctgtga tgaacaaccc caccagacagcctttggccc 
taaacgttgc atacaggcgc tgcttgcaga ttctggctgc aggactgttcctgccaggtt cagtgggtat cactgacccc 
25 tgtgagagtg gcaactttag agtacacacagtcatgaccc tagaacagca ggacatggtc tgctatacag ctcagactct 
cgtccgaatcctctcacatg gtggctttag gaagatcctt ggccaggagg gtgatgccag ctatcttgcttctgaaatat 
ctacctggga tggagtgata gtaacacctt cagaaaaggc ttatgagaagccaccagaga agaaggaagg agaggaagaa 
gaggagaata cagaaagaac cacctcaaggagaggaagaa gaaagcatgg aaactcagga gtgacattcc cttcactcct 
tttcctacccaagggaaaga ctggagccta a 

30 
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What is claimed is: 

1. A method for modulating viral RNA replication and translation, in a eukaryotic cell, of 
positive-strand viral RNA, comprising the step of contacting a viral RNA-binding protein 

5 (vRbp) with a compound that modulates an activity of said vRbp. 

2. The method of claim 1, wherein said vRbp is selected from the group consisting of: vRbpl30, 
vRbpl20, vRbpl 10, vRbp84, vRbp64, and vRbp45. 

10 3. The method of claim 1 wherein said activity of the vRbp is selected from the group 
consisting of: 

a response to viral RNA, 

a response to interferon induction, 

a response to double-stranded RNA-dependent protein kinase (PKR), and 
15 a response to vRbp. 

4. The method of claim 3 wherein said response is formation of a viralxellular 
ribonucleoprotein (RNP) complex. 

20 5. The method of claim 4 wherein said RNP complex comprises a viral RNA:vRbp interaction. 

6. The method of claim 5 wherein said viral RNA:vRbp interaction comprises binding of a vRbp 
to a viral RNA 3* untranslated region (3UTR). 

25 7. The method of claim 4 wherein said viral RNA:vRbp interaction comprises binding of a vRbp 
to a viral RNA 5' untranslated region (5UTR). 

8. The method of claim 5 wherein said 3UTR is a UGA box consensus sequence. 

30 9. The method of claim 3 wherein said response is viral rircularization. 

10. The method of claim 9 wherein said viral circularization comprises binding of vRbp to the 
viral 5*UTR and 3'UTR creating a physical and functional link between both ends of the 
RNA. 

35 

51 



WO 2004/029199 



PCT/US2003/028654 



1 1. The method of claim 9 wherein said viral circularization comprises an interaction between 
viral 5UTR, 3TJTR RNA, vRbp, and cellular proteins involved in the interferon antiviral 
response, 

5 12. The method of claim 3 wherein said response is increase in translational frameshif ting that 
result in decreased viral replication. 

13. Hie method of claim 3 wherein said response is fonnation of a vRbp:PKR interaction. 

10 14. The method of claim 1 wherein said viral replication and translation comprises coordinated 
regulation of replication and translation of viral RNA. 

15. The method of claim 1, wherein said eukaryotic cell is a mammalian cell. 

15 16. The method of claim 1, wherein said eukaryotic cell is a human cell. 

17. The method of claim 1, wherein said eukaryotic cell is a liver cell. 

18. The method of claim 1, wherein said positive strand viral RNA comprises RNA from a 
20 member of the family Flaviviridae. 

19. The method of claim 1 wherein said positive strand viral RNA comprises RNA from a 
member of the family Picornaviridae. 

25 20. The method of claim 1 wherein said compound comprises therapeutically effective amounts 
of viral 3TJTR, fragments thereof, or pharmaceutical^ acceptable derivatives thereof. 

21. The method of claim 1 wherein vRbp activity is reduced by interfering with the interaction 
between vRbp and vRbp recognition sites on viral RNA. 

30 

22. The method of claim 1 wherein vRbp activity is reduced by modification of a viral 3UTR, 
which modification otherwise reduces vRbp binding to vRbp recognition sites on viral 
RNA. 
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23. The method of claim 1 wherein vRbp activity is reduced by inhibiting dissociation of viral 
RNA:vRbp complexes. 

24. A method for reducing the effects of viral infection on eukaryotic cells, comprising 

5 inhibiting vRbp activity in the cell such that viral replication and translation of viral RNA 

is regulated by interactions between vRbp and said viral RNA, comprising introducing a 
nucleic acid decoy molecule into the cell in an amount sufficient to inhibit viral RNA:vRbp 
interactions, which decoy includes a vRbp recognition site that binds to vRbp. 

10 25. A method for reducing the effects of viral infection on eukaryotic cells, comprising 

inhibiting vRbp activity in the cell such that viral replication and translation of viral RNA 
is regulated by interactions between vRbp and PKR, comprising introducing a nucleic acid 
decoy molecule into the cell in an amount sufficient to inhibit vRbp:PKR interactions, 
which decoy includes a vRbp recognition site that binds to vRbp. 

15 

26. A method for reducing the effects of viral infection on eukaryotic cells, comprising the step 
of reducing vRbp activity in the cell such that viral replication and translation is reduced. 

27. A method for reducing the effects of viral infection on eukaryotic cells, the method 
20 comprising the step of reducing vRbp activity in the cell such that production of novel 

infectious virus particles is reduced. 

28. A method for reducing the effects of viral infection on eukaryotic cells, the method 
comprising the steps of reducing vRbp activity in the cell to inhibit the spread of virus in 

25 infected individuals and animals. 

29. A method for reducing the effects of viral infection on eukaryotic cells, the method 
comprising the steps of reducing vRbp activity in the cell to prevent the spread of virus 
between different individuals and animals. 

30 

30. A method for reducing the effects of viral infection on eukaryotic cells, the method 
comprising the steps of reducing vRbp activity in the cell to treat syndromes caused by co- 
infection of different viruses. 
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31. The method of claim 30 wherein different viruses comprise HCV and HBV or HCV and 
HIV. 

32. A method for reducing the effects of viral infection on eukaryotic cells, the method 

5 comprising the steps of reducing vRbp activity in the cell to treat before, during, and after a 

transplantation. 

33. A method for reducing the effects of viral infection on eukaryotic cells, the method 
comprising the steps of modulating vRbp activity in the cell to treat immunosuppressed 

10 patients to prevent virus infections. 

34. A method for reducing the effects of viral infection, in a eukaryotic cell, by modulating 
vRbp activity in the cell, the method comprising the step of interfering with viral 
translation termination as a mechanism to disrupt viral replication. 

15 

35. A method for reducing the effects of viral infection, in a eukaryotic cell, by modulating 
viral RNA-binding protein (vRbp) activity in the cell, the method comprising the step of 
interfering with interactions between viral 3TJTR and 5TJTR, or interactions between 
structural elements within the 3TJTR and NS5B stop codon as a mechanism to regulate 

20 translation termination, translation^ frameshifting, and the coordinated balance of 

replication and translation on positive strand RNA. 

36. A method of treating or preventing a viral infection by a virus comprising the step of 
administering a therapeutically effective amount of a compound to an individual suspected 

25 of having or being at risk of having an infection with a virus. 

37. The method of claim 35 wherein said positive strand viral RNA comprises RNA from a 
member of the family Flaviviridae. 

30 38. The method of claim 35 wherein said positive strand viral RNA comprises RNA from a 
member of the family Picornaviridae. 

39. The method of claim 36 wherein said virus is selected from the group consisting of: hepatitis 
A virus (HAV), hepatitis C virus (HCV), human Rhinovirus (HRV), bovine viral diarrhea 
35 virus (BVDV), and classical swine fever viras(CSFV). 
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40. The method of claim 36 wherein said compound interacts with viral genomic 3TJTR or 5TJTR 
RNA. 

5 41. A method for modulating the function of a viral 3UTR comprising the step of contacting 
a 3UTR with a compound that modulates the structure of the 3TJTR as to inhibit the 
interaction between 3UTR and vRbp. 

42. A method for screening to identify compounds that activate or that inhibit the function of 
10 vRbp which comprises a method selected from the group consisting of: 

(a) mixing a candidate compound with a solution containing a vRbp, to form a 
mixture, measuring activity of tbd vRbp in the mixture, and comparing the activity of 
the mixture to a standard; 

(b) detecting the effect of a candidate compound on the production of viral RNA in a 
15 eukaryotic cell, using for instance, an ELISA assay, reticulocyte lysate translation 

assay (luciferase RNA); and 

(c) (1) contacting a composition comprising the vRbp with the compound to be 
screened under conditions to permit interaction between the compound and the vRbp to 
assess the interaction of a compound, such interaction being associated with a second 

20 component capable of providing a detectable signal in response to the interaction of the 

vRbp with the compound; and 

(2) detenriining whether the compound interacts with and activates or inhibits an 
activity of the vRbp by detecting the presence or absence of a signal generated from the 
interaction of the compound with the vRbp. 

25 

43. A method for screening to identify compounds that increase translational frameshifting 
resulting in decreased replication of viral RNA comprising a method selected from the 
group consisting of: 

(a) mixing a candidate compound with a solution containing a vRbp, to form a mixture, 
30 measuring activity of the vRbp in the mixture, and comparing the activity of the 

mixture to a standard; and 

(b) detecting the effect of a candidate compound on the production of viral RNA in a 
eukaryotic cell, using for instance, an ELISA assay, reticulocyte lysate translation 
assay (luciferase RNA). 
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44. The method of claim 43 wherein said viral RNA is positive strand viral RNA. 

45. The method of claim 44 wherein said positive strand viral RNA comprises RNA from a 
member of the family Flaviviridae. 

5 

46. The method of claim 44 wherein said positive strand viral RNA comprises RNA from a 
member of the family Picornaviridae. 
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Fig. 1 
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Fig. 2 



Organization of BVDV and HCV subgenomic RNA replicons 
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Fig. 3 



Structure and functional features of the BVDV and HCV 3TJTR 
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? ig.4 

Structure and functional features of the BVDV and HCV 5*DTR 
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Fig. 5A 



A set of cellular proteins binds specifically 
totheBVDV3'UTR 
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Fig, SB 



The cellular factors are ubiquitous; 
they associate also with the CSFV 3'UTR 
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Fig. 5C 



Competition assay 
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5g. 5E 



The entire set of cellular proteins binds to a single 
VGA box element; binding is specific 
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C Regulation of translation versus replication of tbe genome 
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Fig. 8 



Different viral IRES elements recruit the same set of 
cellular proteins 
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Fig. 10 



Some of the viral RNA binding proteins are members of the NFAT/NF/NFAR family 
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Purification procedure 
Mela cytoplasmic (SIO) extract 

1. ammoniumsulfate fractionation (25-30% to/v> 

2- heparine sepharose (flow thru) 

3. MonoQ sepharose (50mM-lM KCI) 




ca. 35 O - 420 mM 
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Fig. 11 



RMSA/RNA-protem coprecipitation experiments 
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Fig. 12A 



Functional BVDV/HCV and HCV/BVDV chimera 



hairpin IA 
BVDV 



G A 

A A 

U G . 

V G 

A C 

A A 

g-c 

A-U 

G-C 

C-G 

A-U 

U-A 

A-U • 

U-A 

G-C 



hotrpin IA 
HCV(mod) 



AU 
G V 
C-G 
C-G 
C-G 
C-G 
C-G 

Gl/AUGCCAG-C 



it* 

$ $ $ 

/ft 



pi 10. p!20, pi 30 
p84 

p64, p67» 
PTB 



UV crosslink 



HCV 3 'UTR 
(mod) 



BVDV UGA boxes 



stop 



):.:,;:O.RF;l:;:,I 



ASLstop 




pll0.pl20.pl30 



18/20 



WO 2004/029199 PC17US2003/028654 



ig. 12B 



BVDV and HCV 3'UTR; HCV mutants 
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SEQUENCE. LISTING 

<110> SmithKline Beecham Corporation 

<120> A Set of Ubiquitous Cellular Proteins 
Involved in Viral Life Cycle 



<130> P51375 

<140> Unassigned 
<141> Herewith 

<150> 60/410,460 
<151> 2002-09-13 

<160> 8 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 1270 
<212> PRT 
<213> Homo sapien 

<400> 1 

Met Gly Asp Val Lys Asn Phe Leu Tyr Ala Trp Cys Gly Lys Arg Lys 

1 5 . 10 15 

Met Thr Pro Ser Tyr Glu lie Arg Ala Val Gly Asn Lys Asn Arg Gin 

20 25 30 

Lys Phe Met Cys Glu Val Gin Val Glu Gly Tyr Asn Tyr Thr Gly Met 

35 40 45 

Gly Asn Ser Thr Asn Lys Lys Asp Ala Gin Ser Asn Ala Ala Arg Asp 

50 55 60 

Phe Val Asn Tyr Leu Val Arg lie Asn Glu lie Lys- Ser Glu Glu Val 
65 70 75 80 

Pro Ala Phe Gly Val Ala Ser Pro Pro Pro Leu Thr Asp Thr Pro Asp 

85 90 95 

Thr Thr Ala Asn Ala Glu Gly Asp Leu Pro Thr Thr Met Gly Gly Pro 

100 105 110 

Leu Pro Pro His Leu Ala Leu Lys Ala Glu Asn Asn Ser Glu Val Gly 
115 120 125 
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Ala Ser Gly Tyr Gly Val Pro Gly Pro Thr Trp 
130 135 



Asp Arg Gly Ala Asn 
140 



Leu Lys Asp Tyr Tyr Ser Arg Lys Glu Glu Gin Glu Val Gin Ala Thr 
145 150 155 160 

Leu Glu Ser Glu Glu Val Asp Leu Asn Ala Gly Leu His Gly Asn Trp 

165 170 175 

Thr Leu Glu Asn Ala Lys Ala Arg Leu Asn Gin Tyr Phe Gin Lys Glu 

180 185 190 

Lys He Gin Gly Glu Tyr Lys Tyr Thr Gin Val Gly Pro Asp His Asn 

195 200 205 

Arg Ser Phe He Ala Glu Met Thr He Tyr He Lys Gin Leu Gly Arg 

210 215 220 

Arg He Phe Ala Arg Glu His Gly Ser Asn Lys Lys Leu Ala Ala Gin 
225 230 235 240 

Ser Cys Ala Leu Ser Leu Val Arg Gin Leu Tyr His Leu Gly Val Val 

245 250 255 

Glu Ala Tyr Ser Gly Leu Thr Lys Lys Lys Glu Gly Glu Thr Val Glu 

260 265 270 

Pro Tyr Lys Val Asn Leu Ser Gin Asp Leu Glu His Gin Leu Gin Asn 

275 280 285 

He He Gin Glu Leu Asn Leu Glu He Leu Pro Pro Pro Glu Asp Pro 

290 295 300 

Ser Val Pro Val Ala Leu Asn He Gly Lys Leu Ala Gin Phe Glu Pro 
305 310 315 320 

Ser Gin Arg Gin Asn Gin Val Gly Val Val Pro Trp Ser Pro Pro Gin 

325 330 335 

Ser Asn Trp Asn Pro Trp Thr Ser Ser Asn He Asp Glu Gly Pro Leu 

340 345 350 

Ala Phe Ala Thr Pro Glu Gin He Ser Met Asp Leu Lys Asn Glu Leu 

355 360 365 

Met Tyr Gin Leu Glu Gin Asp His Asp Leu Gin Ala He Leu Gin Glu 

370 375 380 

Arg Glu Leu Leu Pro Val Lys Lys Phe Glu Ser Glu He Leu Glu Ala 
385 390 395 400 

He Ser Gin Asn Ser Val Val He He Arg Gly Ala Thr Gly Cys Gly 

405 410 415 

Lys Thr Thr Gin Val Pro Gin Phe He Leu Asp Asp Phe He Gin Asn 

420 425 430 

Asp Arg Ala Ala Glu Cys Asn He Val Val Thr Gin Pro Arg Arg He 

435 440 445 

Ser Ala Val Ser Val Ala Glu Arg Val Ala Phe Glu Arg Gly Glu Glu 

450 455 460 

Pro Gly Lys Ser Cys Gly Tyr Ser Val Arg Phe Glu Ser He Leu Pro 
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465 470 
Arg Pro His Ala Ser He 
485 

Lys Leu Glu Ala Gly lie 
500 

He His Glu Arg Asp He 
515 

Asp Val Val Gin Ala Tyr 
530 

Thr He Asp Thr Ser Met 
545 550 
He Glu Val Tyr Gly Arg 
565 

Asp Cys He Gin Met Thr 
580 

Lys Lys Asp Lys Asp Asp 
595 

Asn Leu He Cys Gly Asp 
610 

Ser Gin Leu Asn Glu Lys 
625 630 
Leu Lys Tyr He Glu Thr 
645 

Leu Pro Gly Trp Asn Leu 
660 

Asn Pro His Phe Gly Ser 
675 

Gin He Pro Arg Glu Glu 
690 

Gly Val Thr Lys Val He 
705 710 
Thr lie Asn Asp Val Val 
725 

Lys Leu Phe Thr Ala His 
740 

Ala Ser Lys Thr Asn Leu 
755 

Thr Ala Gly Phe Cys Phe 
770 

Leu Glu Thr His Met Thr 
785 790 
He Ala Leu Ser He Lys 
805 



475 

Met Phe Cys Thr Val Gly 
4 90 

Arg Gly He Ser His Val 
505 

Asn Thr Asp Phe Leu Leu 
520 

Pro Glu Val Arg He Val 
535 540 
Phe Cys Glu Tyr Phe Phe 
555 

Thr Tyr Pro Val Gin Glu 
570 

His Phe Val Pro Pro Pro 
585 

Asp Gly Gly Glu Asp Asp 
600 

Glu Tyr Gly Pro Glu Thr 
615 620 
Glu Thr Pro Phe Glu Leu 
635 

Leu Asn Val Pro Gly Ala 
650 

He Tyr Thr Met Gin Lys 
665 

His Arg Tyr Gin He Leu 
680 

Gin Arg Lys Val Phe Asp 
695 700 
Leu Ser Thr Asn He Ala 
715 

Tyr Val He Asp Ser Cys 
730 

Asn Asn Met Thr Asn Tyr 
745 

Glu Gin Arg Lys Gly Arg 
760 

His Leu Cys Ser Arg Ala 
775 780 
Pro Glu Met Phe Arg Thr 
795 

Leu Leu Arg Leu Gly Gly 
810 



480 

Val Leu Leu Arg 
495 

He Val Asp Glu 
510 

Val Val Leu Arg 
525 

Leu Met Ser Ala 

Asn Cys Pro He 
560 

Tyr Phe Leu Glu 
575 

Lys Asp Lys Lys 
590 

Asp Ala Asn Cys 
605 

Arg Leu Ser Met 

He Glu Ala Leu 
640 

Val Leu Val Phe 
655 

His Leu Glu Met 
670 

Pro Leu His Ser 
685 

Pro Val Pro Val 

Glu Thr Ser He 
720 

Lys Gin Lys Val 
735 

Ser Thr Val Trp 
750 

Ala Gly Arg Ser 
765 

Arg Phe Glu Arg 

Pro Leu His Glu 
800 

He Gly Gin Phe 
815 
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Leu Ala Lys Ala lie Glu Pro Pro Pro Leu Asp Ala Val lie Glu Ala 

820 825 830 

Glu His Thr Leu Arg Glu Leu Asp Ala Leu Asp Ala Asn Asp Glu Leu 

835 840 845 

Thr Pro Leu Gly Arg He Leu Ala Lys Leu Pro He Glu Pro Arg Phe 

850 855 860 

Gly Lys Met Met He Met Gly Cys He Phe Tyr Val Gly Asp Ala He 
865 870 875 880 

Cys Thr He Ala Ala Ala Thr Cys Phe Pro Glu Pro Phe He Asn Glu 

885 890 895 

Gly Lys Arg Leu Gly Tyr He His Arg Asn Phe Ala Gly Asn Arg Phe 

900 905 910 

Ser Asp His Val Ala Leu Leu Ser Val Phe Gin Ala Trp Asp Asp Ala 

915 920 925 

Arg Met Gly Gly Glu Glu Ala Glu lie Arg Phe Cys Glu His Lys Arg 

930 935 940 

Leu Asn Met Ala Thr Leu Arg Met Thr Trp Glu Ala Lys Val Gin Leu 
945 950 955 960 

Lys Glu He Leu He Asn Ser Gly Phe Pro Glu Asp Cys Leu Leu Thr 

965 970 975 

Gin Val Phe Thr Asn' Thr Gly Pro Asp Asn Asn Leu Asp Val Val He 

980 985 990 

Ser Leu Leu Ala Phe Gly Val Tyr Pro Asn Val Cys Tyr His Lys Glu 

995 1000 1005 

Lys Arg Lys He Leu Thr Thr Glu Gly Arg Asn Ala Leu He His Lys 
1010 1015 1020 

. Ser Ser Val Asn Cys Pro Phe Ser Ser Gin Asp Met Lys Tyr Pro Ser 
1025 1030 1035 1040 

Pro Phe Phe Val Phe Gly Glu Lys He Arg Thr Arg Ala He Ser Ala 

1045 1050 1055 

Lys Gly Met Thr Leu Val Pro Pro Leu Gin Leu Leu Leu Phe Ala Ser 

1060 1065 1070 

Lys Lys Val Gin Ser Asp Gly Gin lie Val Leu Val Asp Asp Trp He 

1075 1080 1085 

Lys Leu Gin He Ser His Glu Ala Ala Ala Cys He Thr Gly Leu Arg 

1090 1095 1100 

Ala Ala Met Glu Ala Leu Val Val Glu Val Thr Lys Gin Pro Ala He 
1105 1110 1115 1120 

He Ser Gin Leu Asp Pro Val Asn Glu Arg Met Leu Asn Met He Arg 

1125 1130 1135 

Gin He Ser Arg Pro Ser Ala Ala Gly He Asn Leu Met He Gly Ser 

1140 1145 1150 

Thr Arg Tyr Gly Asp Gly Pro Arg Pro Pro Lys Met Ala Arg Tyr Asp 
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1155 1160 1165 

Asn Gly Ser Gly Tyr Arg Arg Gly Gly Ser Ser Tyr Ser Gly Gly Gly 

1170 1175 1180 

Tyr Gly Gly Gly Tyr Ser Ser Gly Gly Tyr Gly Ser Gly Gly Tyr Gly 
1185 1190 1195 1200 

Gly Ser Ala Asn Ser Phe Arg Ala Gly Tyr Gly Ala Gly Val Gly Gly 

1205 1210 1215 

Gly Tyr Arg Gly Val Ser Arg Gly Gly Phe Arg Gly Asn Ser Gly Gly 

1220 1225 1230 

Asp Tyr Arg Gly Pro Ser Gly Gly Tyr Arg Gly Ser Gly Gly Phe Gin 

1235 1240 1245 

Arg Gly Gly Gly Arg Gly Ala Tyr Gly Thr Gly Tyr Phe Gly Gin Gly 

1250 1255 1260 

Arg Gly Gly Gly Gly Tyr 
1265 1270 



<210> 2 
<211> 3810 
<212> DNA 
<213> Home sapien 

<400> 2 

atgggtgacg ttaaaaattt tctgtatgcc 
tatgaaatta gagcagtggg gaacaaaaac 
gaaggttata attacactgg catgggaaat 
gctgccagag actttgttaa ctatttggtt 
ccagcttttg gggtagcatc tccgccccca 
gctgaaggag atttaccaac aaccatggga 
gcagaaaata attctgaggt aggggcctct 
cgaggagcca acttgaagga ttactactca 
ctagaatcag aagaagtgga tttaaatgct 
gctaaagctc gtctaaacca atattttcag 
acccaagtgg gtcctgatca caacaggagc 
cagctgggca gaaggatttt tgcacgagaa 
tcctgtgccc tgtcacttgt cagacaactg 
ggacttacaa agaagaagga aggagagaca 
gatttagagc atcagctgca aaacatcatt 
cctgaagatc cttctgtgcc agttgcactc 
tctcagcgac aaaaccaagt gggtgtggtt 
ccttggacta gtagcaacat tgatgagggg 
agcatggacc tcaagaatga attgatgtac 
atcttgcagg agagagagtt actgcctgtg 



tggtgtggca aaaggaagat gaccccatcc 60 
aggcagaaat tcatgtgtga ggttcaggtg 120 
tccaccaata aaaaagatgc acaaagcaat 180 
cgaataaatg aaataaagag tgaagaagtt 240 
cttactgata ctcctgacac tacagcaaat 300 
ggacctcttc ctccacatct ggctctcaaa 360 
ggctatggtg ttcctgggcc cacctgggac 420 
agaaaggaag aacaagaagt gcaagcgact 480 
gggcttcatg gaaactggac cttggaaaat 540 
aaagaaaaga tccaaggaga atataagtac 600 
tttattgcag aaatgaccat ttatatcaag 660 
catggatcaa ataagaaatt ggcagcacag 720 
taccatcttg gagtggttga agcttactcc 780 
gtggagcctt acaaagtaaa cctctctcaa 840 
caagagctaa atcttgagat tttgcccccg 900 
aacattggca aattggctca gttcgaacca 960 
ccttggtcac ctccacaatc caactggaat 1020 
cctctggctt ttgctactcc agagcaaata 1080 
cagttggaac aggatcatga tttgcaagca 1140 
aagaaatttg aaagtgagat tctggaagca 1200 
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atcagccaaa attcagttgt cattattaga ggggctactg gatgtgggaa aaccacacag 1260 
gttccccagt tcattctaga tgactttatc cagaatgacc gagcagcaga gtgtaacatc 1320 
gtagtaactc agcccagaag aatcagtgcg gtttctgtgg cagagcgagt tgcatttgaa 1380 
agaggagaag agcctggaaa aagctgtggc tacagcgttc gatttgagtc tatacttcct 1440 
cgtcctcatg ccagtataat gttttgtact gtaggtgtgc tcctgagaaa attagaagca 1500 
ggcattcgag gaatcagtca tgtaattgta gatgaaatac atgaaagaga tattaatact 1560 
gacttccttc tggtagtact gcgtgatgtt gttcaggctt atcctgaagt tcgcattgtt 1620 
cttatgtctg ctactattga taccagcatg ttttgtgaat atttcttcaa ttgccccatc 1680 
attgaagttt atgggaggac ttacccagtt caagaatatt ttctggaaga ctgcattcag 1740 
atgacccact ttgttcctcc accaaaagac aaaaagaaga aggataagga tgatgatggt 1800 
ggtgaggatg atgatgcaaa ttgcaacttg atctgtggtg atgaatatgg tccagaaaca 1860 
aggttgagca tgtctcaatt gaacgaaaag gaaactcctt ttgaactcat cgaggctcta 1920 
cttaagtaca ttgaaaccct taatgttcct ggagctgtgt tggttttttt gcctggctgg 1980 
aatctgattt atactatgca gaagcatttg gaaatgaatc cacattttgg aagccatcgg 2040 
tatcagattc tacccctgca ttctcagatt cctcgagagg aacagcgcaa agtgtttgat 2100 
ccagtaccag ttggagtaac caaggttatt ttgtccacaa atattgctga aacaagcatt 2160 
accataaacg atgttgttta tgtcattgac tcctgcaagc agaaagtgaa actcttcact 2220 
gctcacaaca atatgaccaa ctattctacc gtatgggcat caaaaacaaa ccttgagcaa 228.0 
cggaaagggc gagctggccg . gagtacggct ggattctgct ttcacctgtg cagccgagct 2340 
cgttttgaga gacttgaaac ccacatgaca ccagagatgt tccgaacacc attgcatgaa 2400 
attgctctta gcataaaact tctgcgtcta ggaggaattg gccaatttct ggccaaagca 24 60 
attgaacctc cccctttgga tgctgtgatt gaagcagaac acactcttag agagcttgat 2520 
gcattagatg ccaatgatga gttgactcct ttgggacgaa tcctggctaa actccccatt 2580 
gagcctcgtt ttggcaaaat gatgataatg gggtgtattt tctacgtggg agatgctatc 2640 
tgtaccattg ctgctgctac ctgctttcca gagcctttca tcaatgaagg aaagcggctg 2700 
ggctatatcc atcgaaattt tgctggaaac agattttctg atcacgtagc ccttttatca 2760 
gtattccaag cctgggatga tgctagaatg ggtggagaag aagcagagat acgtttttgt 2820 
gagcacaaaa gacttaatat ggctacacta agaatgacct gggaagccaa agttcagctc 2880 
aaagagattt tgattaattc tgggtttcca gaagattgtt tgttgacaca agtgtttact 2940 
aacactggac cagataataa tttggatgtt gttatctccc tcctggcctt tggtgtgtac 3000 
cccaatgtat gctatcataa ggaaaagagg aagattctca ccactgaagg gcgtaatgca 3060 
cttatccaca aatcatctgt taattgtcct tttagtagcc aagacatgaa gtacccatct 3120 
cccttctttg tatttggtga aaagattcga actcgagcca tctctgctaa aggcatgact 3180 
ttagtacccc ccctgcagtt gcttctcttt gcctccaaga aagtccaatc tgatgggcag 3240 
attgtgcttg tagatgactg gattaaactg caaatatctc atgaagctgc tgcctgtatc 3300 
actggtctcc gggcagccat ggaggctttg gttgttgaag taaccaaaca acctgctatc 3360 
atcagccagt tggaccccgt aaatgaacgt atgctgaaca tgatccgtca gatctctaga 3420 
ccctcagctg ctggtatcaa ccttatgatt ggcagtacac ggtatggaga tggtccacgt 3480 
cctcccaaga tggcccgata cgacaatgga agcggatata gaaggggagg ttctagttac 3540 
agtggtggag gctatggcgg tggctatagc agtggaggct atggtagcgg aggctatggt 3600 
ggcagcgcca actcctttcg ggcaggatat ggtgcaggtg ttggtggagg ctatagagga 3660 
gtttcccgag gtggctttag aggcaactct ggaggagact acagagggcc tagtggaggc 3720 
tacagaggat ctgggggatt ccagcgagga ggtggtaggg gggcctatgg aactggctac 3780 
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tttggacagg gaagaggagg tggcggctat 3810 

<210> 3 
<211> 894 
<212> PRT 
<213> Homo sapien 



<400> 3 

Met Arg Pro Met Arg lie Phe Val Asn Asp Asp Arg His Val Met Ala 

15 10 15 

Lys His Ser Ser Val Tyr Pro Thr Gin Glu Glu Leu Glu Ala Val Gin 

20 25 30 

Asn Met Val Ser His Thr Glu Arg Ala Leu Lys Ala Val Ser Asp Trp 

35 40 45 

He Asp Glu Gin Glu Lys Gly Ser Ser Glu Gin Ala Glu Ser Asp Asn 

50 55 60 

Met Asp Val Pro Pro Glu Asp Asp Ser Lys Glu Gly Ala Gly Glu Gin 
65 70 75 80 

Lys Thr Glu His Met Thr Arg Thr Leu Arg Gly Val Met Arg Val Gly 

85 90 95 

Leu Val Ala Lys Cys Leu Leu Leu Lys Gly Asp Leu Asp Leu Glu Leu 

100 105 . 110 

Val Leu Leu Cys Lys Glu Lys Pro Thr Thr Ala Leu Leu Asp Lys Val 

115 120 125 

Ala Asp Asn Leu Ala He Gin Leu Ala Ala Val Thr Glu Asp Lys Tyr 

130 135 140 

Glu He Leu Gin Ser Val Asp Asp Ala Ala He Val He Lys Asn Thr 
145 150 155 160 

Lys Glu Pro Pro Leu Ser Leu Thr lie His Leu Thr Ser Pro Val Val 

165 170 175 

Arg Glu Glu Met Glu Lys Val Leu Ala Gly Glu Thr Leu Ser Val Asn 

180 185 190 

Asp Pro Pro Asp Val Leu Asp Arg Gin Lys Cys Leu Ala Ala Leu Ala 

195 200 205 

Ser Leu Arg His Ala Lys Trp Phe Gin Ala Arg Ala Asn Gly Leu Lys 

210 215 220 

Ser Cys Val He Val He Arg Val Leu Arg Asp Leu Cys Thr Arg Val 
225 230 235 240 

Pro Thr Trp Gly Pro Leu Arg Gly Trp Pro Leu Glu Leu Leu Cys Glu 

245 250 255 

Lys Ser He Gly Thr Ala Asn Arg Pro Met Gly Ala Gly Glu Ala Leu 

260 265 270 

Arg Arg Val Leu Glu Cys Leu Ala Ser Gly He Val Met Pro Asp Gly 
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275 

Ser Gly lie Tyr 
290 

His Leu Asp Arg 
305 

Ala Leu Arg Leu 

Asp Pro Leu Pro 
340 

Val Asp Tyr Thr 
355 

Pro Met Lys Arg 
370 

Lys Lys Lys Lys 
385 

Gin Ala Met Asn 

Gin Tyr Lys Leu 
420 

Thr Met Ser Val 
435 

Ser Lys Lys Thr 
450 

Met Gly Leu Pro 
465 

Asp Ser Ala Glu 

Pro Val Val Glu 
500 

Ala Thr Ala Glu 
515 

Val Met Glu Leu 
530 

Ser Glu Thr Gly 
545 

Val Asp Gly Gin 

Lys Ala Tyr Ala 
580 

Pro Leu Ala Leu 
595 

Arg Gly Gly Pro 
610 



280 285 
Asp Pro Cys Glu Lys Glu Ala Thr Asp Ala lie Gly 

295 300 
Gin Gin Arg Glu Asp He Thr Gin Ser Ala Gin His 
310 315 320 

Ala Ala Phe Gly Gin Leu His Lys Val Leu Gly Met 
325 330 335 

Ser Lys Met Pro Lys Lys Pro Lys Asn Glu Asn Pro 

345 350 
Val Gin He Pro Pro Ser Thr Thr Tyr Ala He Thr 

360 365 
Pro Met Glu Glu Asp Gly Glu Glu Lys Ser Pro Ser 

375 380 
Lys He Gin Lys Lys Glu Glu Lys Ala Glu Pro Pro 
390 395 400 

Ala Leu Met Arg Leu Asn Gin Leu Lys Pro Gly Leu 
405 410 415 

Val Ser Gin Thr Gly Pro Val His Ala Pro He Phe 

425 430 
Glu Val Asp Gly Asn Ser Phe Glu Ala Ser Gly Pro 

440 445 
Ala Lys Leu His Val Ala Val Lys Val Leu Gin Asp 

455 460 
Thr Gly Ala Glu Gly Arg Asp Ser Ser Lys Gly Glu 
470 475 480 

Glu Thr Glu Ala Lys Pro Ala Val Val Ala Pro Ala 
485 490 495 

Ala Val Ser Thr Pro Ser Ala Ala Phe Pro Ser Asp 

505 510 
Gin Gly Pro He Leu Thr Lys His Gly Lys Asn Pro 

520 525 
Asn Glu Lys Arg Arg Gly Leu Lys Tyr Glu Leu He 

535 540 
Gly Ser His Asp Lys Arg Phe Val Met Glu Val Glu 
550 555 560 

Lys Phe Gin Gly Ala Gly Ser Asn Lys Lys Val Ala 
565 570 575 

Ala Leu Ala Ala Leu Glu Lys Leu Phe Pro Asp Thr 

585 590 
Asp Ala Asn Lys Lys Lys Arg Ala Pro Val Pro Val 

600 605 
Lys Phe Ala Ala Lys Pro His Asn Pro Gly Phe Gly 
615 620 
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Met Gly Gly Pro Met His Asn Glu Val Pro Pro Pro Pro Asn Leu Arg 
625 630 635 1 640 

Gly Arg Gly Arg Gly Gly Ser He Arg Gly Arg Gly Arg Gly Arg Gly 

645 650 655 

Phe Gly Gly Ala Asn His Gly Gly Tyr Met Asn Ala Gly Ala Gly Tyr 

660 665 670 

Gly Ser Tyr Gly Tyr Gly Gly Asn Ser Ala Thr Ala Gly Tyr Ser Gin 

675 680 685 

Phe Tyr Ser Asn Gly Gly His Ser Gly Asn Ala Ser Gly Gly Gly Gly 

690 695 700 

Gly Gly Gly Gly Gly Ser Ser Gly Tyr Gly Ser Tyr Tyr Gin Gly Asp 
705 710 715 720 

Asn Tyr Asn Ser Pro Val Pro Pro Lys His Ala Gly Lys Lys Gin Pro 

725 730 735 

His Gly Gly Gin Gin Lys Pro Ser Tyr Gly Ser Gly Tyr Gin Ser His 

740 745 750 

Gin Gly Gin Gin Gin Ser Tyr Asn Gin Ser Pro Tyr Ser Asn Tyr Gly 

755 760 765 

Pro Pro. Gin Gly Lys Gin Lys Gly Tyr Asn His Gly Gin Gly Ser Tyr 

770 775 780 

Ser Tyr Ser Asn Ser Tyr Asn Ser Pro Gly Gly Gly Gly Gly Ser Asp 
785 790 795 800 

Tyr Asn Tyr Glu Ser Lys Phe Asn Tyr Ser Gly Ser Gly Gly Arg Ser 

805 810 815 

Gly Gly Asn Ser Tyr Gly Ser Gly Gly Ala Ser Tyr Asn Pro Gly Ser 

820 825 830 

His Gly Gly Tyr Gly Gly Gly Ser Gly Gly Gly Ser Ser Tyr Gin Gly 

835 840 845 

Lys Gin Gly Gly Tyr Ser Gin Ser Asn Tyr Asn Ser Pro Gly Ser Gly 

850 855 860 

Gin Asn Tyr Ser Gly Pro Pro Ser Ser Tyr Gin Ser Ser Gin Gly Gly 
865 870 875 880 

Tyr Gly Arg Asn Ala Asp His Ser Met Asn Tyr Gin Tyr Arg 
885 890 



<210> 4 
<211> 2685 
<212> DNA 
<213> Homo sapien 

<400> 4 

atgcgtccaa tgcgaatttt tgtgaatgat gaccgccatg tgatggcaaa gcattcttcc 60 
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gtttatccaa cacaagagga gctggaggca 
gcgctcaaag ctgtgtccga ctggatagac 
gagtccgata acatggatgt gcccccagag 
aagacggagc acatgaccag aaccctgcgg 
tgcctcctac tcaaggggga cttggatctg 
acaaccgccc tcctggacaa ggtggccgac 
gaagacaagt acgaaatact gcaatctgtc 
aaagagcctc cattgtccct gaccatccac 
gagaaagtat tagctggaga aacgctatca 
cagaaatgcc ttgctgcctt ggcgtccctc 
aacgggctga agtcttgtgt cattgtgatc 
cccacctggg gtcccctccg aggctggcct 
acggccaaca gaccgatggg tgctggcgag 
tcgggcatcg tgatgccaga tggttctggc 
gatgctattg ggcatctaga cagacagcaa 
gcactgcggc tcgctgcctt cggccagctc 
tccaagatgc ccaagaaacc aaagaatgaa 
ccaagcacca cctatgccat tacgcccatg 
aagtcgccca gcaaaaagaa gaagaagatt 
caggctatga atgccctgat gcggttgaac 
gtgtcccaga ctgggcccgt ccatgccccc 
aattcattcg aggcctctgg gccctccaaa 
gtgttacagg acatgggctt gccgacgggt 
gactcggctg aggagaccga ggcgaagcca 
gctgtctcca cccctagtgc ggcctttccc 
ctgacaaagc acggcaagaa cccagtcatg 
tacgagctca tctccgagac cgggggcagc 
gtggatggac agaagttcca aggtgctggt 
gctcttgctg ccctagaaaa gcttttccct 
aagaagagag ccccagtacc cgtcagaggg 
cctggcttcg gcatgggagg ccccatgcac 
gggcggggaa gaggcgggag catccgggga 
aaccatggag gctacatgaa tgccggtgct 
tctgcgacag caggctacag tcagttctac 
ggcggtggcg gcgggggcgg tggtggctcc 
aactacaact caccggtgcc cccaaaacac 
cagaagccct cctacggctc gggctaccag 
cagagcccct acagcaacta tggccctcca 
caaggcagct actcctactc gaactcctac 
tacaactacg agagcaaatt caactacagt 
tacggctcag gcggggcatc ctacaaccca 
gggggcggct cctcatacca aggcaaacaa 
ccggggtccg gccagaacta cagtggccct 



g tccagaaca 


tacrtatccca 




120 


aaacaacraaa 
y ay v<ay yoofl 


y y y ay way 


poaara a err* a. 
^yoy^oyy^itt 


XOKJ 


rrarns r*arrt* a 


adyaayyy yu 


4- rr effort a a pa n 

l y y y y aauay 






gyy uyyyuu-L 


yy Lyy^oaay 


JUL* 


yd.yu>LyyLyu 


ctf aa 
uytLy uy taa 


ftfrarra a ct f /"^ /~> 


JDU 


aap r*i" n nc* r* a 


LLuay^Luyo 


trrrfftfaapa 
Ly^Ly uaaud 




yatya l y uiy 


u.y a L Ly Lyav. 


aaaaaar*ar>a 


4P0 
ft 0 U 




uuy l uy LL.ay 


ay dayaoa uy 


r A n 

D ft U 


<rt~ r*^a a o rra r*f* 
y LLaatyduL 


orppn/ra /■* rT4~ 
U.L>u.L,y y aL»y u 


uuLyyd.ca.gg 


con 


uyouci^^^ua 


^ n t* rr ft t* +• r» r» a 
ay ^»y y *- u^^a 


rr rr p p a rTa etc* 
yyLLdyayuL 


DDU 


uy y y u c c. uy a. 


yyya.OL.ugty 


cdctcy cgug 




ut^yayctuu 


uy ty Lyayaa 


auCCaLuyyC 




y ccl, uy cyya 


gag ugc ugga 


g ugee uggcg 


ft a n 

O fl u 


attcatgacc 


c u ugugaaaa 


agaagecact 


ann 

yuu 


cgggaaga i-a 


t cacacagag 


tgcgcagcac 


you 


cataaa gtcc 


t aggcatgga 


ccctctgcct 




aacccagtgg 


actacaccgt 


tcagatccca 


1 f\ O f\ 

lUoU 


aaacgcccaa 


tggaggagga 


eggggaggag 


I 1 in 

II 4*0 


cagaagaaag 


aggagaaggc 


agagcccccc 


IzUU 


cagctgaagc 


cagggctgca 


gtacaagctg 




atctttacca 


tgtctgtgga 


ggttgatggc 


1 320 


aagacggcca 


agctgcacgt 


ggccgttaag 


1 JoO 


gctgaaggca 


gggactcgag 


caagggggag 


1 A A f\ 

1440 


gcagugg ugg 


cccctgcccc 


agtggtagaa 




tcagat gcca 


ctgccgagca 


ggggecgate 


J.3D0 


gagctgaacg 


agaagaggcg 


tgggctcaag 


icon 


cacgacaagc 


gcttcgtcat 


ggaggtcgaa 


icon 


tccaacaaaa 


aggtggcgaa 


ggcctacgct 


i *7 x n 
1 /40 


gacacccctc 


tcgccct tga 


tgccaacaaa 


loOO 


ggaccgaaat 


uugcugctaa 


gccacataac 


1860 


aacgaagt gc 


ccccaccccc 


caaccttcga 


1920 


cgagggcgcg 


ggcgaggatt 


tggtggcgcc 


1980 


gggtatggaa 


gctatgggta 


eggaggcaac 


2040 


agcaacggag 


ggcattctgg 


gaatgccagt 


2100 


tccggctatg 


gctcctacta 


ccaaggtgac 


2160 


gctgggaaga 


agcagccgca 


egggggecag 


2220 


tcccaccagg 


gccagcagca 


gtcctacaac 


2280 


cagggcaagc 


agaaaggcta 


taaccatgga 


2340 


aactctcccg 


ggggcggggg 


cggatccgac 


2400 


ggtagtggag 


gccgaagcgg 


egggaacage 


2460 


gggtcacacg 


ggggctacgg 


eggaggttet 


2520 


ggaggctact 


cacagtcgaa 


ctacaactcc 


2580 


cccagctcct 


accagtcctc 


acaaggegge 


2640 
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tatggcagaa acgcagacca cagcatgaac taccagtaca gataa 26 

<210> 5 

<211> 702 

<212> PRT 

<213> Homo sapien 

<400> 5 

Met Arg Pro Met Arg He Phe Val Asn Asp Asp Arg His Val Met Ala 

1 5 10 15 

Lys His Ser Ser Val Tyr Pro Thr Gin Glu Glu Leu Glu Ala Val Gin 

20 25 30 

Asn Met Val Ser His Thr Glu Arg Ala Leu Lys Ala Val Ser Asp Trp 

35 40 45 

He Asp Glu Gin Glu Lys Gly Ser Ser Glu Gin Ala Glu Ser Asp Asn 

50 55 60 

Met Asp Val Pro Pro Glu Asp Asp Ser Lys Glu Gly Ala Gly Glu Gin 
65 70 75 80 

Lys Thr Glu His Met Thr Arg Thr Leu Arg Gly Val Met Arg Val Gly 

85 90 95 

Leu Val Ala Lys Cys Leu Leu Leu Lys Gly Asp Leu Asp Leu Glu Leu 

100 105 110 

Val Leu Leu Cys Lys Glu Lys Pro Thr Thr Ala Leu Leu Asp Lys Val 

115 120 125 

Ala Asp Asn Leu Ala He Gin Leu Ala Ala Val Thr Glu Asp Lys Tyr 

130 135 140 

Glu He Leu Gin Ser Val Asp Asp Ala Ala He Val He Lys Asn Thr 
145 150 155 160 

Lys Glu Pro Pro Leu Ser Leu Thr He His Leu Thr Ser Pro Val Val 

165 170 175 

Arg Glu Glu Met Glu Lys Val Leu Ala Gly Glu Thr Leu Ser Val Asn 

180 185 190 

Asp Pro Pro Asp Val Leu Asp Arg Gin Lys Cys Leu Ala Ala Leu Ala 

195 200 205 

Ser Leu Arg His Ala Lys Trp Phe Gin Ala Arg Ala Asn Gly Leu Lys 

210 215 220 

Ser Cys Val He Val He Arg Val Leu Arg Asp Leu Cys Thr Arg Val 
225 230 235 240 

Pro Thr Trp Gly Pro Leu Arg Gly Trp Pro Leu Glu Leu Leu Cys Glu 

245 250 255 

Lys Ser He Gly Thr Ala Asn Arg Pro Met Gly Ala Gly Glu Ala Leu 

260 265 270 

Arg Arg Val Leu Glu Cys Leu Ala Ser' Gly He Val Met Pro Asp Gly 
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275 280 
Ser Gly lie Tyr Asp Pro Cys Glu Lys Glu Ala Thr 
290 295 300 

His Leu Asp Arg Gin Gin Arg Glu Asp He Thr Gin 
305 310 315 

Ala Leu Arg Leu Ala Ala Phe Gly Gin Leu His Lys 

325 330 
Asp Pro Leu Pro Ser Lys Met Pro Lys Lys Pro Lys 

340 345 
Val Asp Tyr Thr Val Gin He Pro Pro Ser Thr Thr 

355 360 
Pro Met Lys Arg Pro Met Glu Glu Asp Gly Glu Glu 
370 375 380 

Lys Lys Lys Lys Lys He Gin Lys Lys Glu Glu Lys 
385 390 395 

Gin Ala Met Asn Ala Leu Met Arg Leu Asn Gin Leu 

405 410 
Gin Tyr Lys Leu Val Ser Gin Thr Gly Pro Val His 

420 425 
Thr Met Ser Val Glu Val Asp Gly Asn Ser Phe Glu 

435 440 
Ser Lys Lys Thr Ala Lys Leu His Val Ala Val Lys 
450 455 460 

Met Gly Leu Pro Thr Gly Ala Glu Gly Arg Asp Ser 
465 470 475 

Asp Ser Ala Glu Glu Thr Glu Ala Lys Pro Ala Val 

485 490 
Pro Val Val Glu Ala Val Ser Thr Pro Ser Ala Ala 

500 505 
Ala Thr Ala Glu Gin Gly Pro He Leu Thr Lys His 

515 520 
Val Met Glu Leu Asn Glu Lys Arg Arg Gly Leu Lys 
530 535 540 

Ser Glu Thr Gly Gly Ser His Asp Lys Arg Phe Val 
545 550 555 

Val Asp Gly Gin Lys Phe Gin Gly Ala Gly Ser Asn 

565 570 
Lys Ala Tyr Ala Ala Leu Ala Ala Leu Glu Lys Leu 

580 585 
Pro Leu Ala Leu Asp Ala Asn Lys Lys Lys Arg Ala 

595 600 
Arg Gly Gly Pro Lys Phe Ala Ala Lys Pro His Asn 
610 615 620 



285 

Asp Ala He Gly 

Ser Ala Gin His 
320 

Val Leu Gly Met 
335 

Asn Glu Asn Pro 
350 

Tyr Ala He Thr 
365 

Lys Ser Pro Ser 

Ala Glu Pro Pro 
400 

Lys Pro Gly Leu 
415 

Ala Pro He Phe 
430 

Ala Ser Gly Pro 
445 

Val Leu Gin Asp 

Ser Lys Gly Glu 
480 

Val Ala Pro Ala 
495 

Phe Pro Ser Asp 
510 

Gly Lys Asn Pro 
525 

Tyr Glu Leu He 

Met Glu Val Glu 
560 

Lys Lys Val Ala 
575 

Phe Pro Asp Thr 
590 

Pro Val Pro Val 
605 

Pro Gly Phe Gly 
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Met Gly Gly Pro Met His Asn Glu 
625 630 
Gly Arg Gly Arg Gly Gly Ser lie 
645 

Phe Gly Gly Ala Asn His Gly Gly 
660 

Gly Ser Tyr Gly Tyr Gly Gly Asn 
675 680 
Phe Phe Thr Asp Cys Tyr Gly Tyr 
690 695 



Val Pro Pro Pro Pro Asn Leu Arg 
635 640 
Arg Gly Arg Gly Arg Gly Arg Gly 

650 655 
Tyr Met Asn Ala Gly Ala Gly Tyr 
665 670 
Ser Ala Thr Ala Gly Tyr Ser Asp 
685 

His Asp Phe Gly Ser Ser 
700 



<210> 6 
<211> 2107 
<212> DNA 
<213> Homo sapien 

<400> 6 

atgcgtccaa tgcgaatttt tgtgaatgat 
gtttatccaa cacaagagga gctggaggca 
gcgctcaaag ctgtgtccga ctggatagac 
gagtccgata acatggatgt gcccccagag 
aagacggagc acatgaccag aaccctgcgg 
tgcctcctac tcaaggggga cttggatctg 
acaaccgccc tcctggacaa ggtggccgac 
gaagacaagt acgaaatact gcaatctgtc 
aaagagcctc cattgtccct gaccatccac 
gagaaagtat tagctggaga aacgctatca 
cagaaatgcc ttgctgcctt ggcgtccctc 
aacgggctga agtcttgtgt cattgtgatc 
cccacctggg gtcccctccg aggctggcct 
acggccaaca gaccgatggg tgctggcgag 
tcgggcatcg tgatgccaga tggttctggc 
gatgctattg ggcatctaga cagacagcaa 
gcactgcggc tcgctgcctt cggccagctc 
tccaagatgc ccaagaaacc aaagaatgaa 
ccaagcacca cctatgccat tacgcccatg 
aagtcgccca gcaaaaagaa gaagaagatt 
caggctatga atgccctgat gcggttgaac 
gtgtcccaga ctgggcccgt ccatgccccc 
aattcattcg aggcctctgg gccctccaaa 
gtgttacagg acatgggctt gccgacgggt 
gactcggctg aggagaccga ggcgaagcca 



gaccgccatg 


tgatggcaaa 


gcattcttcc 


60 


gtccagaaca 


tggtgtccca 


cacggagcgg 


120 


gagcaggaaa 


agggtagcag 


cgagcaggca 


180 


gacgacagta 


aagaaggggc 


tggggaacag 


240 


ggagtgatgc 


gggtgggcct 


ggtggcaaag 


300 


gagctggtgc 


tgctgtgtaa 


ggagaagccc 


360 


aacctggcca 


tccagcttgc 


tgctgtaaca 


420 


gacgatgctg 


cgattgtgat 


aaaaaacaca 


480 


ctgacatccc 


ctgttgtcag 


agaagaaatg 


540 


gtcaacgacc 


ccccggacgt 


tctggacagg 


600 


cgacacgcca 


agtggttcca 


ggccagagcc 


660 


cgggtcttga 


gggacctgtg 


cactcgcgtg 


720 


ctcgagctcc 


tgtgtgagaa 


atccattggc 


780 


gccctgcgga 


gagtgctgga 


gtgcctggcg 


840 


atttatgacc 


cttgtgaaaa 


agaagccact 


900 


cgggaagata 


tcacacagag 


tgcgcagcac 


960 


cataaagtcc 


taggcatgga 


ccctctgcct 


1020 


aacccagtgg 


actacaccgt 


tcagatccca 


1080 


aaacgcccaa 


tggaggagga 


cggggaggag 


1140 


cagaagaaag 


aggagaaggc 


agagcccccc 


1200 


cagctgaagc 


cagggctgca 


gtacaagctg 


1260 


atctttacca 


tgtctgtgga 


ggttgatggc 


1320 


aagacggcca 


agctgcacgt 


ggccgttaag 


1380 


gctgaaggca 


gggactcgag 


caagggggag 


1440 


gcagtggtgg 


cccctgcccc 


agtggtagaa 


1500 
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gctgtctcca cccctagtgc ggcctttccc 
ctgacaaagc acggcaagaa cccagtcatg 
tacgagctca tctccgagac cgggggcagc 
gtggatggac agaagttcca aggtgctggt 
gctcttgctg ccctagaaaa gcttttccct 
gaagagagcc ccagtacccg tcagaggggg 
tggcttcggc atgggaggcc ccatgcacaa 
gcggggaaga ggcgggagca tccggggacg 
ccatggaggc tacatgaatg ccggtgctgg 
tgcgacagca ggctacagtg actttttcac 
ttcctag 



tcagatgcca 


ctgccgagca 


ggggccgatc 


1560 


gagctgaacg 


agaagaggcg 


tgggctcaag 


1620 


cacgacaagc 


gcttcgtcat 


ggaggtcgaa 


1680 


tccaacaaaa 


aggtggcgaa 


ggcctacgct 


1740 


gacacccctc 


gcccttgatg 


ccaacaaaaa 


1800 


accgaaattt 


gctgctaagc 


cacataaccc 


1860 


cgaagtgccc 


ccacccccca 


accttcgagg 


1920 


agggcgcggg 


cgaggatttg 


gtggcgccaa 


1980 


gtatggaagc 


tatgggtacg 


gaggcaactc 


2040 


agactgctac 


ggctatcatg 


attttgggtc 


2100 
2107 



<210> 7 
<211> 406 
<212> PRT 
<213> Homo sapien 

<400> 7 

Met Arg Gly Asp Arg'Gly Arg Gly Arg Gly Gly Arg Phe Gly Ser Arg 

15 10 15 

Gly Gly Pro Gly Gly Gly Phe Arg Pro Phe Val Pro His He Pro Phe 

20 25 30 

Asp Phe Tyr Leu Cys Glu Met Ala Phe Pro Arg Val Lys Pro Ala Pro 

35 40 45 

Asp Glu Thr Ser Phe Ser Glu Ala Leu Leu Lys Arg Asn Gin Asp Leu 

50 55 60 

Ala Pro Asn Ser Ala Glu Gin Ala Ser He Leu Ser Leu Val Thr Lys 
65 70 75 80 

He Asn Asn Val He Asp Asn Leu He Val Ala Pro Gly Thr Phe Glu 

85 90 95 

Val Gin He Glu Glu Val Arg Gin Val Gly Ser Tyr Lys Lys Gly Thr 

100 105 110 

Met Thr Thr Gly His Asn Val Ala Asp Leu Val Val He Leu Lys He 

115 120 125 

Leu Pro Thr Leu Glu Ala Val Ala Ala Leu Gly Asn Lys Val Val Glu 

130 135 140 

Ser Leu Arg Ala Gin Asp Pro Ser Glu Val Leu Thr Met Leu Thr Asn 
145 150 155 160 

Glu Thr Gly Phe Glu He Ser Ser Ser Asp Ala Thr Val Lys He Leu 

165 170 175 

He Thr Thr Val Pro Pro Asn Leu Arg Lys Leu Asp Pro Glu Leu His 

180 185 190 

teu Asp He Lys Val Leu Gin Ser Ala Leu Ala Ala He Arg His Ala 
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195 

Arg Trp Phe Glu Glu 
210 

Arg Leu Leu Lys Asp 
225 

Thr Pro Trp He Leu 
245 

Pro Thr Arg Gin Pro 
260 

Gin He Leu Ala Ala 
275 

Asp Pro Cys Glu Ser 
290 

Glu Gin Gin Asp Met 
305 

Leu Ser His Gly Gly 
325 

Ser Tyr Leu Ala Ser 
340 

Pro Ser Glu Lys Ala 
355 

Glu Glu Glu Glu Asn 
370 

Lys His Gly Asn Ser 
385 

Lys Gly Lys Thr Gly 
405 



200 

Asn Ala Ser Gin 
215 

Leu Arg He Arg 
230 

Asp Leu Leu Gly 

Leu Ala Leu Asn 
265 

Gly Leu Phe Leu 
280 

Gly Asn Phe Arg 
295 

Val Cys Tyr Thr 
310 

Phe Arg Lys He 

Glu He Ser Thr 
345 

Tyr Glu Lys Pro 
360 

Thr Glu Arg Thr 
375 

Gly Val Thr Phe 

390 

Ala 



205 

Ser Thr Val Lys 
220 

Phe Pro Gly Phe 
235 

His Tyr Ala Val 
250 

Val Ala Tyr Arg 

Pro Gly Ser Val 
285 

Val His Thr Val 
300 

Ala Gin Thr Leu 
315 

Leu Gly Gin Glu 
330 

Trp Asp Gly Val 

Pro Glu Lys Lys 
365 

Thr Ser Arg Arg 
380 

Pro Ser Leu Leu 
395 



Val Leu He 

Glu Pro Leu 
240 

Met Asn Asn 
255 

Arg Cys Leu 
270 

Gly He Thr 

Met Thr Leu 

Val Arg He 
320 

Gly Asp Ala 

335 
He Val Thr 
350 

Glu Gly Glu 

Gly Arg Arg 

Phe Leu Pro 
400 



<210> 8 
<211> 1221 
<212> DNA 
<213> Homo sapien 

<400> 8 

atgaggggtg acagaggccg tggtcgtggt 
ggagggttca ggccctttgt accacatatc 
tttccccggg tcaagccagc acctgatgag 
aaccaggacc tggctcccaa ttctgctgaa 
ataaacaatg tgattgataa tctgattgtg 
gaagttcgac aggtgggatc ctataaaaag 
gacctggtgg tgatactcaa gattctgcca 
aaagtcgtgg aaagcctaag agcacaggat 



gggcgctttg gttccagagg aggcccagga 60 

ccatttgact tctatttgtg tgaaatggcc 120 

acttccttca gtgaggcctt gctgaagagg 180 

caggcatcta tcctttctct agtgacaaaa 240 

gctccaggga catttgaagt gcaaattgaa 300 

gggacaatga ctacaggaca caatgtggct 360 

acgttggaag ctgttgctgc cctggggaac 420 

ccttctgaag ttttaaccat gctgaccaac 480 
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gaaacaggct ttgaaatcag ttcttctgat 
ccacccaatc ttcgaaaact ggatccagaa 
gccttagcag ccatccgaca tgcccgctgg 
aaagttctca tcagactact gaaggacttg 
acaccctgga tccttgacct actaggccat 
cctttggccc taaacgttgc atacaggcgc 
ctgccaggtt cagtgggtat cactgacccc 
gtcatgaccc tagaacagca ggacatggtc 
ctctcacatg gtggctttag gaagatcctt 
tctgaaatat ctacctggga tggagtgata 
ccaccagaga agaaggaagg agaggaagaa 
agaggaagaa gaaagcatgg aaactcagga 
aagggaaaga ctggagccta a 



gctacagtga agattctcat tacaacagtg 540 
ctccatttgg atatcaaagt attgcagagt 600 
ttcgaggaaa atgcttctca gtccacagtt 660 
aggattcgtt ttcccggctt tgagcccctc 720 
tatgctgtga tgaacaaccc caccagacag 780 
tgcttgcaga ttctggctgc aggactgttc 840 
tgtgagagtg gcaactttag agtacacaca 900 
tgctatacag ctcagactct cgtccgaatc 960 
ggccaggagg gtgatgccag ctatcttgct 1020 
gtaacacctt cagaaaaggc ttatgagaag 1080 
gaggagaata cagaaagaac cacctcaagg 1140 
gtgacattcc cttcactcct tttcctaccc 1200 

1221 
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